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Abstract 

A  discrete-time  svstem  of  two  queues  served  according  to  distinct  policies  is  studied  in  this  paper;  one  queue  receives 
I  -limited  service;  the  other  is  served  according  to  some  policy  front  the  introduced  class  of  (state  dependent)  policies  5. 
Class  S  contains  traditional  service  policies  as  well  as  ad  hoc  policies.  A  partial  performance  ordering  of  policies  in  <S  is 
derived  which  can  be  useful  in  designing  ad  hoc  policies  with  improved  performance  compared  to  that  of  traditional  policies. 
Some  examples  of  ad  hoc  policies  in  S  arc  presented  and  numerical  results  are  derived  to  illustrate  the  potential  for  efficiency 
provided  by  service  policies  in  S. 

Keywords:  Discrete-time  queueing  systems;  M/C/ 1  type  Markov  chain;  Matrix  analytic  technique;  Service  strategies 


1.  Introduction 

Discrete-time  queueing  systems  have  widely  been  adopted  for  the  study  of  packet  communication  net¬ 
works,  where  packet  processes  are  described  in  terms  of  discrete-time  stochastic  point  processes.  The 
service  policies  associated  with  such  systems  model  the  protocols  that  govern  the  availability  of  network 
resources  to  the  packets,  users  or  classes  of  users.  The  distributed  nature  of  a  networking  system  and/or 
the  diversified  service  requirements  of  the  supported  users  necessitate  the  development  of  sophisticated 
resource  allocation  protocols  modeled  in  terms  of  queueing  systems  under  various  service  policies.  Exam¬ 
ples  are  the  exhaustive,  gated  and  limited  service  policies  which  have  been  studied  extensively,  primarily  in 
continuous  time,  in  the  context  of  polling  [1,2),  and  vacation  systems  [3 1.  and  the  consistent  gated/limited 
policy  [4)  which  has  been  applied  for  the  study  of  the  DQDB  network  [5,6]. 

*  Research  supported  in  part  by  the  National  Science  Foundation  under  gram  NCR-901  1 962  and  by  the  Advanced  Research 
Project  Agency  ( ARPA)  under  Grant  F49620-93- 1  -0564  monitored  by  the  Air  Force  Office  of  Scientific  Research  (AFOSR). 
This  work  was  conducted  while  the  authors  were  with  the  Department  of  Computer  Science  and  Electrical  Enuineerine  of  the 
University  of  Vermont.  Burlington.  VT  05405.  USA. 

*  Corresponding  author.  Tel.;  (617)  373-3053;  fax;  (617)  373-8970. 
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A  discrete-time  single  server  queueing  system  consisting  of  two  queues  is  considered  in  this  paper.  One  of 
the  queues  receives  1  -limited  service,  the  other  is  served  according  to  a  policy  contained  in  a  versatile  class 
of  state-dependent  service  policies  5.  This  class  contains  some  of  the  well-known  service  policies  as  well 
as  new  ad  hoc  policies  which  may  provide  for  improved  performance  compared  to  that  under  traditional 

policies. 

The  system  studied  in  this  paper  may  be  considered  as  a  variant  of  a  two-queue  asymmetric  polling  system 
with  1 -limited  service  at  one  queue  and  service  according  to  a  policy  in  5  at  the  other.  When  both  queues 
receive  1 -limited  service  the  system  operates  under  an  alternating  service  policy,  which  has  been  analyzed 
in  continuous  time  [7],  A  continuous-time  two-queue  model  with  mixed  exhaustive  and  limited  services 
has  been  studied  in  [8].  A  discrete-time  model  with  gated  and  1 -limited  services  has  been  analyzed  in  [4], 

The  system  considered  in  this  paper  may  also  be  described  in  terms  of  a  GI/D/ 1  infinite  queue  with  server 
vacations.  The  vacation  period  distribution  depends  on  the  occupancy  of  the  queue  which  is  served  during 
the  vacation.  The  queueins  model  and  analysis  developed  in  this  paper  are  also  applicable  to  a  system  w  ith 
vacation  period  distributions  that  depend  on  the  state  of  a  Markov  process  which  evolves  independently 
of  the  system  [9).  This  state-dependent  vacation  model  is  different  from  most  of  those  presented  in  the 
past,  for  example  (10.1 1  ].  in  the  following  aspect:  the  process  whose  state  determines  the  occurrence  and 
distribution  of  vacation  periods  is  external  to  the  queue  under  study.  The  analysis  is  based  on  matrix  analytic 

techniques  similar  to  those  applied  for  the  study  of  the  vacation  model  in  [  12]. 

The  detailed  description  of  the  queueing  system  and  a  unified  representation  of  the  class  of  service  policies 
S  are  presented  in  Section  2.  In  Section  3.  a  queueing  model  for  the  system  is  formulated  and  analyzed. 
The  joint  probability  distribution  of  the  queue  occupancies  is  derived  by  applying  matrix  analytic  methods 
and  Markov  renewal  theory  arguments.  Section  4  presents  a  partial  performance  ordering  for  policies  in  5. 
Some  numerical  results  which  illustrate  the  effect  of  various  service  disciplines  on  the  performance  of  the 
system  are  presented  in  Section  5.  The  work  is  summarized  in  Section  6. 


2.  Description  of  the  queueing  system 
2.1.  Introduction 

Consider  the  discrete-time  queueing  system  shown  in  Fig.  1.  The  packet  service  time  is  assumed  to  be 
constant  and  equal  to  the  system  time  unit  (slot).  Unless  otherwise  stated,  a  superscript  I  (F)  will  indicate  a 
quantity  associated  with  queue  Q[  (QF).  The  capacities  of  queues  QF  and  Ql  are  assumed  to  be  N  <  oo 
and  infinite,  respectively.  Let  {^};>o  and  (^F];>o  denote  the  associated  queue  occupancy  processes  with 

state  spaces  /?’  =  (0.  1.2... .)  and  /?F  =  (0,  1 . N], 

The  system  service  discipline  is  determined  by  the  general  rules  presented  in  Section  2.2  and  the  adopted 
service  policy  in  5  presented  in  Section  2.3.  A  brief  introduction  to  the  service  discipline  is  presented 

first.  .  F 

The  server  never  idles  as  long  as  there  is  service  to  be  performed.  Upon  switching  to  Q  .  the  server 

serves  a  number  of  F-packets  (packets  in  QF)  which  depends  on  the  number  of  packets  found  in  Q  and 
the  number  of  packets  left  in  Q '.  If  the  number  of  F-packets  served  cannot  exceed  the  number  of  F-packets 
already  present  in  QF  at  the  switching  instant,  then  this  state-policy  (see  Section  2.3)  wilt  be  called  gated. 
Otherwise,  it  will  be  called  non-gated.  If  all  state-policies  of  a  service  policy  are  gated  (non-gated)  the 
service  policy  will  also  be  called  gated  (non-gated).  Following  the  service  to  QF  the  server  switches  to  Q 
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Fig.  L  The  queueing  system. 


and  provides  1-limited  service  to  that  queue.  Although  a  service  policy  other  than  the  1-limited  could  be 
considered  for  Q\  the  analysis  developed  in  this  work  is  applicable  only  under  1 -limited  service  to  Ql. 

2.2.  The  general  sendee  mechanism 

The  system  is  work  conserving;  that  is.  the  server  is  never  idle  when  the  system  is  non-empty.  The  server 
is  assumed  to  be  at  QF  when  idle.  Let  a  decision  time-instant  rd  be  defined  as  the  time  instant  at  which  a 
packet  is  forwarded  to  the  head  of  Q1;  the  collection  of  decision  time-instants  is  denoted  by  .4. 

(i)  At  /d,  td  e  A  the  server  switches  to  QF\  the  amount  of  service  provided  to  this  queue  is  determined 
by  the  adopted  policy  in  5  (Section  2.3).  Upon  completion  of  this  service  to  (?F — which  can  possibly 
be  equal  to  0 — the  server  switches  to  Q{  and  serves  one  packet. 

(ii)  Upon  completion  of  the  1-limited  (packet)  service  to  Ql: 

(a)  If  Q{  is  left  non-empty,  a  packet  is  instantaneously  forwarded  to  the  head  of  Ql  and,  thus,  a  decision 
time-instant  ;d  is  reached. 

(b)  If  Q 1  is  left  empty,  QF  is  served  uninterruptedly  until  the  next  decision  time-instant  ;d,  to  be 
determined  by  the  first  future  packet  arrival  to  Ql. 

Notice  that  before  service  is  provided  to  a  packet  forwarded  to  the  head  of  Q\  an  amount  of  service 
is  provided  to  QF ,  as  determined  by  a  policy  in  S.  In  general,  the  amount  of  service  provided  to  QF — 
determined  by  the  policy  in  S — will  depend  on  the  state  (queue  occupancy)  of  QF .  Thus,  in  general,  the 
server  behavior  is  ‘regulated’  by  the  state  of  QF.  Since  the  server  considers  switching  back  to  QF  at  any 
(discrete)  time  instant  at  which  it  is  aw’ay  from  this  queue,  QF  may  be  viewed  as  the  critical  queue.  Although 
almost  the  entirety  of  the  adopted  service  disciplines  for  QF  would  provide  prioritized  service  to  this  queue, 
there  is  at  least  one  such  service  discipline  under  which  the  priority  is  reversed. 

2.  J,  The  class  of  sendee  policies  S 

As  will  become  clear,  a  number  of  traditional  service  disciplines,  as  well  as  ad  hoc  policies — potentially 
improving  the  performance  achieved  by  traditional  ones — can  be  represented  by  selecting  a  proper  service 
discipline  in  S. 

Let  /,  0  <  /  <  A\  denote  the  state  of  {<7F};>o  at  some  decision  time-instant  rd,  rd  e  A.  A  state-policy 
associated  with  state  i.  P,\  is  defined  to  be  equal  to  the  potential  amount  of  service  (in  packets)  provided 
to  Qf  by  the  server,  immediately  after  rd.  0  <  Vj  <  oo.  This  amount  of  service  will  be  provided  to  QF 
unless  the  queue  becomes  empty.  In  view  of  rule  (i),  the  definition  of  Vi  and  the  work-conserving  nature  of 
the  service  discipline,  the  server  will  switch  to  Ql  at  /d  +  r (i).  where  r(/)  =  min(P,*,  rF(/)}  and  ;d  -WF(/) 


18 


/.  Stavrakakis.  5.  Tsakiridou/ Performance  Evaluation  29  ( 1997 )  15-33 


(P ) 


(M 


Fig.  2.  The  space  of  state-policies  SV  (each  dot  marks  a  state-policy  point). 


denotes  the  first  time-instant  following  /d  at  which  Qf  becomes  empty:  r (cj^)  will  be  referred  to  as  the 
service  horizon  following  /d  or  determined  by  <7^. 

A  service  policy  in  5  will  be  defined  in  terms  of  the  state-policies  V,  associated  with  all  states  /.  0  < 
i  <  N .  Since  the  service  policies  considered  here  are  work-conserving,  it  is  easily  established  that  any 
selection  for  Pq  will  induce  identical  server  behavior  as  Pq  =  0.  Pq  =  0  will  be  assumed  for  any  policy  in 
5  and,  thus,  a  policy  in  S  will  be  defined  in  terms  of  P, .  1  <  i  <  /V.  A  service  policy  in  S  will  be  uniquely 

represented  by  the  /V-dimensional  vector  V  =  (V\ ,  P: . P,v).  P  €  S  =  2q  x  Zq  x  •••  x  Zj.  where 

Zq  denotes  the  set  of  non-negative  integers.  As  will  become  clear  below,  this  vector  representation  of  a 
service  policy  in  terms  of  the  state-policies  (P/)-=|  not  only  provides  tor  a  unified  description  (and  study) 
of  many  well  known  service  policies,  but  also  allows  for  the  introduction  of  new  ad  hoc  policies.  The  space 
of  state-policies  SV  is  shown  in  Fig.  2.  A  service  policy  P  e  S  may  be  graphically  represented  by  the  set 
of  state-policy  points  {(/,  P):  1  <  /  <  N}.  or  the  lines  connecting  these  points  ( policy-lines )  on  SV. 

The  following  definitions  will  provide  for  a  classification  of  state-policies  and  service  policies  in  S. 

Definition  1.  A  state-policy  P/  will  be  called  gated  if  only  packets  that  are  present  in  QF  at  some  rd  e  A  at 
which  q ^  =  i  will  be  served  over  the  service  horizon  following  rd;  otherwise,  it  will  be  called  non-gated. 

It  will  be  called  unlimited  (limited)  gated  if  all  (not  all)  packets  that  are  present  in  QF  at  some  rd  €  A  at 
which  <7^  =  i  will  be  served  over  the  service  horizon  following  rd.  It  will  be  called  unlimited  non-gated  if 

exhaustive  service  is  provided  to  QF :  otherwise,  it  will  be  called  limited  non-gated. 

Definition  2.  A  service  policy  P  =  (Pi .  Vi . Vs)  €  5  will  be  called  (limited  or  unlimited)  gated  it  all 

associated  state-policies  are  (limited  or  unlimited)  gated.  It  will  be  called  ( limited  or  unlimited)  non-gated 
if  all  associated  state-policies  are  (limited  or  unlimited)  non-gated. 
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Based  on  Definition  1.  it  can  be  easily  established  that  Vt  is  gated  if  Vi  <  i .  In  this  case.  Vi  packets  will 
be  served  over  the  service  horizon  r  (/ ),  and  thus,  Pr{r(z)  =  Vt  \  =  1.  Vi  is  unlimited  gated  if  Vi  =  /  and 
limited  gated  if  Vi  <  i.  Similarly,  V{  is  non-gated  if  V\  >  i .  In  this  case,  at  least  i  packets  will  be  served 
over  the  service  horizon  r(i)  and  Pr{ r (/ )  =  Vf  ]  <  1.  Vi  is  unlimited  non-gated  if  Vi  =  cc  and  limited 
non-gated  if  Vi  <  cc. 

If  V  is  gated,  the  amount  of  service  provided  to  QF  over  any  service  horizon  r (/ )  is  equal  to  Vl%  for  all 
L  I  <  /  <  /V,  thus;  Pr{  r  (/ )  =  Vi  \  =  1.  The  subclass  of  gated  service  policies  Sc  is  defined  on  the  subspace 
SVg  =  {(/.  *P/):0  <  Vi  <  /.  0  <  /  <  /V}  of  SV. 

If  V  is  non-gated,  the  amount  of  service  provided  to  QF  over  a  service  horizon  z(i)  is  at  least  equal  to  Vt 
and  depends  on  the  evolution  of  (<7;F}y>o  over  that  service  horizon:  thus,  Pr{ r (/ )  =  Vi)  <  1.  The  subclass 
of  non-gated  service  policies  Ssg  is  defined  on  the  subspace  SVsg  =  {0*.  V, ):  Vt  >  /.  0  <  /  <  /V}  of  SV. 

Based  on  the  above  definitions,  the  space  of  state-policies  SV  can  be  divided  into  the  subspaces  of  gated. 
SVg .  and  non-gated,  SVsg .  state-policies;  a  policy-line  in  SV c  (SVsg)  defines  a  gated  (non-gated) 
service  policy  in  S.  The  boundary  between  the  two  subspaces  identifies  the  diagonal  policy-line  (g)  (Fig.  2) 
which  represents  the  unlimited  gated  service  policy.  The  following  definition  provides  for  another  classifi¬ 
cation  of  service  policies  in  S. 

Definition  3.  A  service  policy  V  is  ^-limited,  if  V{  =  k.  1  <  /  <  yV,  for  some  k.  0  <  k  <  cc.  Note  that  a 
A-limited  service  policy  is  not  state-dependent.  This  is  also  indicated  by  the  graphical  representation  of  the 
^-limited  service  policies  in  SV\  each  A'-limited  service  policy.  0  <  k  <  cc.  is  represented  by  a  horizontal 
policy-line  (/*)  (Fig.  2). 

The  state-policies  V,  associated  with  a  ^.'-limited  service  policy  V  may  be  gated  (/  >  k)  or  non-gated 
(/  <  k).  For  the  extreme  values  of  k .  k  =  0  and  k  =  cc.  the  system  service  policy  becomes  HoL  priority 
for  queues  Q[  and  respectively.  For  k  —  1,  the  system  operates  under  an  alternating  service  policy;  a 
single  packet  is  served  at  each  queue,  in  a  alternating  manner,  while  both  queues  are  non-empty. 


2.4 .  An  expansion  of  class  S.  5exp 

Based  on  the  definition  of  a  state-policy  stated  at  the  beginning  of  Section  2.3,  the  potential  amount  of 
service  over  any  service  horizon  r(/)«  for  any  policy  V  €  S  described  so  far,  is  deterministic  and  equal 
to  Vi.  A  generalized  definition  of  a  state-policy  is  introduced  below  to  describe  probabilistically  limited 
service  policies  [13]  and  expand  the  class  of  service  policies  in  S.  A  probabilistically  limited  state-policy 
associated  with  state  /,  1  <  /  <  /V,  may  be  defined  in  terms  of  the  probability  vector  gt  =  [gioSn 
where  g/y,  1  <  /  <  N.  j  >  0,  denotes  the  probability  that  the  potential  amount  of  service  provided  to  QF 
over  a  service  horizon  r  (/)  is  equal  to  J.  A  service  policy  in  the  expanded  class  of  service  policies  •Sesp 
may  be  represented  by  a  matrix  V:  V  =  [^|  §2  Si  ■  •  ■  SV  1- 

Clearly.  S  C  5exp.  A  service  policy  V  €  5  can  be  expressed  in  terms  of  a  matrix  V  by  assigning  the 
proper  distributions  to  every  (deterministic)  state-policy  /: 

II  for  j  —  Vi .  ,  .  ,, 

S''  =  |o  otherwise.  1  5  ^  ,V- 


I 


20 


/.  Stavrakakis.  S.  Tsakiridou/ Performance  Evaluation  29  ( 1997)  15-33 


;W -  \ - 

f - *(q,«) - 


_j _ i — i — i, — i — i — i 

i  >k 

_ _ i — ». 

time  (slots) 

i) 

;  * 

r —  \ — 

<  1  i _ i — i — 

*7* - t(q  j)  ^ 

_ _ i _ _ 

|  tk  ,  |  \  lime  (slots) 


ii) 


Fi^  3.  Illustration  of  the  definitions  hk  and  dk :  (i)  hk  =  0,  (ii)  hk  >  0.  An  arrow  pointing  to  (away  from)  the  time  axis 
©*  . 

indicates  the  movement  of  a  packet  to  (away  from)  the  head  of  Q  . 


A  probabilistically  limited  service  policy  is  gated  if  H'_0S<7  =  1  and  non-gated  if  gij  =  I. 

\  <  i  <  N.  Examples  of  probabilistically  limited  service  policies  are  the  Bernoulli  gated: 


gij  = 


1  -  pi.  j  =  0, 

Pi.  j  =  1 . 

0.  otherwise. 


1  <  i  <  N. 


and  the  binomial  gated: 


gij  = 


(*.)  pj  (•  “  Pi)'  1  .  0  <  j  <i. 
0,  otherwise. 


1  <i<N. 


where  0  <  p,  <  1 ,  1  <  /  <  /V,  and  ('.)  is  the  binomial  coefficient. 

For  pi  =  p2  =  •••  =  p,v  =  1  the  Bernoulli  gated  policy  becomes  1-limited  and  the  binomial  gated 
service  policy  becomes  unlimited  gated.  For  pi  =  P2  =  •  •  •  =  p,v  =  0  both  policies  become  HoL  priority 
at  Q}.  In  the  rest  of  this  paper,  only  service  policies  in  S  will  be  considered. 


3.  Modeling  and  analysis  of  the  queueing  system 

The  queueing  behavior  of  the  system  under  the  adopted  service  discipline  is  analyzed  in  this  section 
under  i.i.d.  and  mutually  independent  arrival  processes  (oj(;>o  and  |aJ}y>o  to  QF  and  Q  ,  respectively. 
The  joint  probability  distribution  of  the  occupancy  at  queues  Q}  and  QF  upon  service  completion  of  a 
packet  at  Ql  is  derived  first,  based  on  a  matrix  analytic  approach.  The  joint  probability  distribution  of  the 
queue  occupancies  at  arbitrary  time  instants  is  then  obtained  by  applying  Markov  renewal  theory  arguments. 
Consider  the  following  time  sequences  defined  at  the  end  of  slots: 

Uk U>o  :  Defined  to  be  the  sequence  of  time  instants  at  which  the  service  of  a  packet  is  completed  at 

(21  (/o  =  0). 

{/j?)*>i  :  Defined  to  be  the  sequence  of  decision  time-instants,  or  time  instants  at  which  a  packet  is 
forwarded  to  the  head  of  )*>i  =  -4). 
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According  to  the  system  service  discipline  rules  described  in  Section  2.2,  the  server  switches  to  Q ^  at 
ik- 1  for  an  amount  of  time  which  depends  on  qjk  (  and  qFd: 

W  0.  by  rule  (ii(a)).  the  kih  packet  is  instantaneously  forwarded  to  the  head  of  Ql.  and  thus, 

r*-l  =  3(i)).  In  this  case,  the  server  will  remain  at  QF  for  r(<7^)  slots. 

(")  If  9/4_i  =  °-  b-v  11112  (*i(b)).  Qx  will  remain  empty  for  a  time  interval  hk.  until  the  first  future  packet 

amval  to  Ql ;  at  r*_i  +  hk  the  decision  time-instant  r;d  is  reached  (Fig.  3(ii)).  In  this  case,  the  server 
will  remain  at  Qp  for  hk  +  r  (<7  j )  slots. 

'k 

The  service  of  the  £th  packet  at  Q1  will  be  completed  at  time  instant  tk  =  rk  +  r(qFd)  +  1  and,  therefore. 

the  time  interval  dk  between  the  successive  service  completions  of  the  (k  -  l)st  and  Jtth  packets  at  Ql  is 
given  by  dk  =  tk-\  -  tk  =  hk  +  r(^J)  +  1  where  hk  =  0.  for  qjki  >  0  (Fig.  3).  It  will  be  shown  in  the 

following  that  the  process  { qjt.qfk)k>0  is  a  Markov  chain  embedded  at  service  completion  time  instants 
(U-k>o-  In  order  to  proceed  with  the  analysis  the  following  random  variables  need  to  be  defined: 

a1  (nF)  :  Denotes  the  number  of  packet  arrivals  to  Ql  (QF)  in  an  arbitrary  slot.  The  probability  distri¬ 
bution  of  a  (nF)  is  denoted  by  p[(n)  (pF(n)).  n  >  0. 

a11  :  Denotes  the  number  of  packet  arrivals  to  Ql  over  /  slots.  It  is  distributed  according  to  pli‘(n), 

the  /-fold  convolution  of  p[(n).  n  >  0  (/?'• 1  (/i)  =  p}t(n)). 

h  ■  Denotes  the  generic  random  variable  for  hk  \  it  describes  the  time  between  the  service  com¬ 
pletion  of  a  packet  that  leaves  Qx  empty  and  the  first  packet  arrival  to  the  empty  queue.  The 
distribution  of  random  variable  h  is  given  by  ph{n)  =  [pj,(0)]"-l[l  -  p,',(0)],n  >  |. 

Proposition  1.  The  process  {qjk .  qF }*  >0  is  a  two-dimensional  Markov  chain  embedded  on  packet  sen-ice 
completion  time  instants ,  {fjt  }jt> o- 

Proof.  It  suffices  to  show  that  qjk  and  qfk  can  probabilistically  be  determined  from  qjk  (  and  qfk  The 

evolution  of  the  occupancy  process  {qj)j>Q  is  determined  based  on  the  server  availability  to  £)F;  transitions 

take  place  at  the  end  of  slots.  If  the  server  is  at  Q?  at  the  end  of  a  slot,  then  process  [q? }  ;>0  makes  a  transition 
according  to  the  probability  matrix  J 
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If  the  server  is  absent  from  QF  at  the  end  of  a  slot,  that  is.  at  time  instants  {r*k>0  transitions  take  place 
according  to  the  probability  matrix  Pv. 

4(b>  4u)  4(2)  •••  4('v-d  E 

n  —  S 
cc 

o  4(0)  4d)  •••  4(tf - 2)  E  4(«> 

n  =  /V  —  I 

(2) 

o  o  4(°)  •••  4(/V_3)  E  4(") 

/i=,V— 2 


Therefore,  the  /-step  transition  probability  PrJr/J+z  —  i  1 4  —  / }  °f  the  process  over  an  inierva!  (;.  j  +  /| 
of  uninterrupted  service  to  Q?  of  length  /  (slots)  is  given  by  /*J(/)  =  (Pj'./  >  1. Given  (r/,l_l-^.,)-(/,d 

is  determined  by  4_,  and  the  transition  matrix  ^(/u)-  where  /i*  =  0.  if  4_t  >  0  and  distnbuted  as 
random  variable  h  otherwise:  since  /;•  =  t£  4-  r(<7^)  +  1. 4  is  determined  by  r/f j  and  the  transition  matrix 

PF(r(4d))/>,.  Since  q]  =  4_,  +  au‘  -  I  and  dk  =  hk  +  r(q*)  +  1.  <  is  determined  by  <4-r4-.>- 

^  't  .  * 

in  view  of  the  above  discussion.  □ 

By  identifying  q*  and  qf.  as  the  level  and  phase  of  process  (4 -4  >*><>•  respectively,  the  transition 
probability  matrix  P{q i„f,  of  this  embedded  Markov  chain  can  be  written  as  a  stochastic  matrix  ot  M/G/ 1 


probability 

type: 


'Bq 

B\ 

B2 

Bl 

Bj, 

^0 

^1 

a2 

••• 

0 

Ao 

A\ 

/A  2 

^3  ••• 

0 

0 

Aq 

A| 

A2 

where  A„  and  B„ .  n  >  0.  are  the  (N  +  l  )-dimensional  matrices  with  elements  A„ (j\  •  h) and  B„(j\  ■  72 )•  j\ • 

h e  pf>  s'ven  by 

An(j\< h)  =  Pr (4  =  »i  -  1  +  «-4  =  y'2l4-i  =  -M •  11  -  *•  (4) 

B„(j\ .  ji)  =  Pr  1 4  =  "•  4  =  h  1 4-1  =  °-  4-1  =  -M  •  (5) 

Note  that  is  the  probability  that  the  process  (4-4^i°  makes  a  lrans,tl0n  from  state  (0.  j\) 

to  state  (n.  p)  over  the  time  interval  between  two  successive  packet  service  completions  at  Q  .  An(j\.  jj) 
is  the  probability  that  over  the  time  interval  between  two  successive  service  completions  at  Q  ,  the  phase 
process  (4 }*>0  makes  a  transition  from  state  j\  to  state  j2.  and  there  are  n  packet  arrivals  to  Q  .  given  that 
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Q1  is  non-empty  at  the  beginning  of  the  transition.  Let  An(j |.  jz-  1)  (Bn(j\.  j-±,  /))  be  defined  similarly  to 
An{j\.  jz)  ( Bn(j\ .  jz))  but  under  the  additional  requirement  that  the  duration  of  a  transition  <4  is  equal  to 
/  slots.  /  >  1: 

A„(j\.j2-D  —  Pr  {<7/.  =  *l  “  1  =  jz>dk  =  1 1  qfi_l  =  j\ }  .  t,  >  I.  (6) 

B„Oi .  jz-  I )  =  Pr  (4  =n.qfk=  jz.  dk-l  \  <7,*.,  =  0.  =  y'( )  .  (7) 

Let  p^(i.j.n)  =  Pr{</F+j|  =  j.  r(<7F)  =  n  |<7F  =  /}.  i.j  €  /?F.  0  <  n  <  V,.  denote  the  transition 
probabilities  of  {<7F|/>o  over  a  service  horizon  of  length  n  slots.  These  are  determined  by  the  adopted 

service  policy  V  and  the  associated  state-policies  V-,.  i  €  /?F.  and  can  be  expressed  in  terms  of  the  one-step 
transition  probabilities  of  {^F)y>o  given  by  probability  matrix  Ps  [14|.  The  stochastic  matrix  with  elements 

pF(t.  j.  n)  is  denoted  by  Px(n).  It  can  be  easily  shown  that 


,v 

A„(j\.  jz.l)  =  p!;'(n)J^pPxU\- j-1  -  \)Pv(j- jz)-  I>  1.  .  (8) 

j=0 

l- 1  .V  ,14-1 

B„u\-jz-i)=  ZUm1"-'  2>Jow.««>  ->  +  n 

m  =  l  ;=  0  1  =  1 

,v 

x  72  Pr(7-  j’.l-m-  DpAj'.  jz).  I  >  2.  (9) 

;'= o 

B„{j\-jz-  D=0. 

From  (8)  and  (9)  matrices  An(!)  and  Bfi(l),  l  >  I,  may  be  written  as 


A„(l)  =  pl;‘(n)PPtd  -  \)PV. 

(10) 

/—I  /f-rl 

B„d)  =  Y:[p\,mm-Xrldn)  72  p'(ld)An-i+\d  -  /«). 

mas  l  1  =  1 

tf„(D  =  0. 

(11) 

Application  of  the  law  of  total  probability  yields 


rc  cc 

=  X>„(/)  =  T,p«l{n)ptV- d/v  (,2) 

/=i  / = I 

CC  /i-M 

Bn  =  72b„(I)  =  [l-  p'a(0)Psr'Ps  £  PlaO)A, (13) 
/=!  /=! 

The  stationary  joint  probabilities  for  the  queue  occupancies  upon  packet  service  completions  at  Q}%  y(t,  j). 
i  €  R\  j  €  as  determined  by  the  transition  probability  matrix  P{q\  are  derived  by  applying  matrix 
analytic  techniques  [  1 4 1.  Thestationary  joint  probabilities  at  arbitrary  time  instants,  j),  i  e  Rl.j  €  /?F 
are  then  obtained  in  terms  of  y(/.  j)  by  invoking  Markov  renewal  theory  arguments  1 1 4 1 . 
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4.  Partial  performance  ordering  of  policies  in  S 

In  view  of  the  structure  of  class  S  it  is  possible  to  establish  a  partial  performance  ordering  of  policies  in 

5,  as  shown  below. 

Definition  4.  Consider  service  policies  X  =  (Xj . X,v}and4?  =  {Q . f?,v|  in  5.  Let  qk  '  denote 

the  occupancy  of  at  time  slot  k  under  policy  V.  M  €  (I.  F),  V  €  {f?.  X}.  Policy  4?  is  said  to  outperform 
policy  X  with  respect  to  Q,u  if  and  only  if  <  q1^'* .  M  €  {I,  F},  for  all  k  >  0.  That  is, 


-  M  - 

E<n 


M  €  (I,  F). 


(14) 


Notice  that  (14)  implies  that  if  E<&,  then 

Lf^  <  L?^  and  D1^  >  Dl*.  (15) 

where  LF-P  denotes  the  packet  loss  probability  at  Q F  under  policy  V  and  Dl,p  denotes  the  average  packet 
delay  at  Ql  under  policy  V,  V  €  (X,  Q).  The  following  proposition  provides  for  a  partial  performance 
ordering  of  policies  in  5;  its  proof  may  be  found  in  Appendix  A. 


Proposition  2.  Consider  two  policies  X  =  (X| . X,v}  and  £2  =  [£2\ . in  S.  Then 

X,  <  min  (4?*)  for  all  /,  1  <  /  <  N.  (16) 

I  <Jt</V 

implies  that 


E<Q  and  Q<E. 


(17) 


Corollary  1.  Let  =  {Jt.Jt . k]  denote  the  k-limited  service  policy  in  5,0  <k<  cc.  Then  Proposition 

2  implies  that 


if  k  <  j  then  and  Cj<£k.  (18) 

Under  some  conditions  on  the  arrival  process  to  Q F  a  weaker  than  (16)  sufficient  condition  tor  perfor¬ 
mance  ordering  of  policies  in  5  may  be  obtained  as  described  in  Proposition  3. 

Proposition3.  Consider  two  policies  X  =  { X" i . X,y}  and  Q  =  (i?j . f?,v}  in  S.  Let  a^{At) 

denote  the  number  of  packet  arrivals  to  QF  over  an  interx'al  of  length  At  (slots).  If 

af{At)  <  At  (19) 

for  all  At,  At  e  (1,2,.. and 

X/  <  for  all  i  >  k  and  k ,  I  <  k  <  N,  (20) 

then 


-  F  -  -  I  - 

E<Q  and  £2<E. 


(21) 
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Fig.  4.  Policy-line  representation  of  the  set  of  policies  in  Example  1 . 

The  proof  of  Proposition  3  may  be  found  in  Appendix  A. 

When  the  arrrival  process  to  QF  is  Bernoulli,  condition  ( 19)  in  Proposition  3  holds  and  thus,  the  following 
corollary  is  evident  in  view  of  Proposition  3. 

Corollary  2.  Consider  two  policies  Z  =  (Tj . Z\)  and  Q  =  (f?( . in  5.  Let  the  packet 

arrival  process  to  QF  be  a  Bernoulli  process .  If 

Z{  <  for  all  i  >  k.  1  <  k  <  N,  (22) 

then 

Z<f>  and  £2<Z.  (23) 


5.  Examples,  numerical  results  and  discussion 


The  HoL  priority  policy  for  Q?  (upper  boundary  of  the  state-policy  space  SV)  minimizes  packet 
delay/loss  for  this  queue.  Similarly,  the  HoL  priority  policy  for  Q 1  (lower  boundary  of  the  state-policy 
space  SV)  minimizes  packet  delay  for  this  queue.  A  (state-dependent)  service  policy  in  S  is  expected  to 
provide  greater  flexibility  in  meeting  performance  requirements  associated  with  each  queue.  The  vector 
representation  of  a  service  policy  in  S  allows  for  the  description  of  ad  hoc.  customized  policies  as  illustrated 
in  the  following  examples. 

Example  1.  The  service  policy  V  defined  by  the  following  state-policies  Vi, 


Vi  = 


1  <  /  <  M, 
M  <  i  <  N, 


1  <  M  <  N, 


provides  unlimited  gated  service  to  Qf  when  the  content  (state)  of  Qf  is  below  some  threshold  /V/,  1  < 
M  5  'V*  and  A/-limited  service  otherwise  (Fig.  4).  This  policy  provides  increased  service  to  Ql  compared 
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Fig.  5.  Policy-line  representation  of  the  set  of  policies  in  Example  2. 


Fig.  6.  Policy-line  representation  of  the  set  of  policies  in  Example  3. 

to  the  unlimited  gated  policy  when  the  content  of  QF  exceeds  the  threshold  M\  it  also  provides  increased 
service  to  compared  to  the  /V/-limited  policy  for  I  <  /  <  M.  The  policy  becomes  I -limited  tor  M  =  I 
and  unlimited  gated  for  M  =  N. 

Example  2.  Consider  the  policies  L which  are  ^-limited  up  to  a  certain  state  /V/,  l  <  M  <  N,  and 
then  drop  linearly  with  fixed  slope,  as  shown  in  Fig.  5.  If  packet  arrivals  to  QF  satisfy  the  condition 
in  Proposition  3,  for  instance,  Bernoulli  arrivals,  then  the  following  performance  ordering  holds  true: 

Lk'*Lk'm  and  Lk'’X<Lk'  if  Mi  <  Mi. 

Example  3.  Consider  the  following  policies  (Fig.  6): 

f>\\  Vi  —  oo.  1  <  /  <  N  (unlimited  non-gated  or  oc-limited  or  HoL  for  Q*)\ 

Vi:  Vj  =  i,  \  <  i  <  N  (unlimited  gated); 

-py.  Vj  =  1.  1  <  /  <  N  (1 -limited); 
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V+:  Vi  =  0,  1  <  /  <  T  and  V\  =  /  —  T.  T  <  /  <  /V;  according  this  service  policy  (71  receives  HoL 
priority  service  whenever  the  content  of  QF  is  less  than  a  threshold  7.  If  the  content  i  of  QF  at  a 
decision  time-instant  exceeds  the  value  of  the  ‘safety’  threshold  7.  service  is  provided  to  i-T  packets 
at  Qf.  Compared  to  the  HoL  priority  service  policy  for  Ql.  this  ‘mixed’  policy  provides  increased 
service  to  the  finite  queue  when  it  is  critically  congested  (/  >  7)  and  is  expected  to  decrease  the  loss 
probability. 

Vs'.  Vi  =  0.  1  <  i  <  N  (0-iimited  or  HoL  for  £>'). 

In  view  of  Proposition  1.  the  following  performance  ordering  of  the  policies  may  be  easily  established: 

Vs<Vs<V2<V\  and  Vs<V4-<V\. 

V\<Vi<Vy<'P$  and  V\<Vi<Vs- 
-I-I--F-F- 

Notice  that  V\  <  V<  Vs  and  V$<  V<  V\  for  all  service  policies  V  z  S.  which  implies  that  V\  and 
determine  bounds  on  the  achievable  performance  by  any  service  policy  in  S. 

The  performance  of  some  service  policies  in  5  is  evaluated  in  terms  of  the  induced  packet  loss  probability 
at  the  finite  queue.  Lp,  and  the  average  packet  delays  D[  and  DF .  These  can  be  easily  determined  from 
the  joint  probability  distribution  of  the  queue  occupancies.  Let  A.F  (A.1)  denote  the  packet  arrival  rate  to  QF 
(Q1).  Then 

f  F  _  +  A1  —  p' 

af 

where  p  —  1  —  ^(0. 0)  =  I  —  Prjp1  =  0.  qF  =  0)  is  the  system  utilization.  The  average  packet  delays  D1 
and  Df  are  derived  from  the  mean  queue  occupancies  E[q{\  and  E[qF\  by  applying  Little’s  theorem: 

d1  =  0"=  «s!i  . 

A1  af(I  -  Lf) 

Some  numerical  results  for  LF  and  D1  induced  by  service  policies  V\  -  Vs  in  Example  3  have  been 
derived  and  are  shown  in  Figs.  7  and  8  under  Bernoulli  and  in  Figs.  9  and  10  under  truncated  Poisson  packet 
arrival  processes  to  each  of  the  queues.  The  finite  queue  capacity  is  N  =  29.  For  P4.  two  different  threshold 
values  are  considered.  7=10  and  7=15.  For  each  type  of  arrival  processes  the  mean  packet  delay  D1 
and  the  packet  loss  probability  LF  are  plotted  versus:  (i)  the  arrival  rate  A.1  for  a  fixed  arrival  rate  AF  =  0.5 
(packets/slot)  and  a  range  of  offered  traffic  load  between  0.8  and  0.925  (Figs.  7  and  9)  and  (ii)  the  arrival 
rate  A.F  fora  fixed  arrival  rate  A.1  =  0.5  (packets/slot)  and  the  same  total  load  range  as  in  (i)  (Figs.  8  and  10). 

The  results  shown  in  Figs.  7-10  are  in  accordance  with  the  performance  ordering  established  by  Propo¬ 
sition  2.  Furthermore,  they  illustrate  the  potential  for  improved  performance  provided  by  the  ad  hoc  service 
policies  in  5,  such  as  Vi.  For  instance,  if  upper  bounds  on  LF  equal  to  10-i:  (under  Bernoulli  arrivals)  or 
10“9  (under  truncated  Poisson  arrivals)  are  necessary  to  meet  quality  of  service  requirements,  then  only 
service  policies  V\ .  Vi  and  74  (7=10)  are  acceptable  for  all  values  of  A.1  and  A.F  considered  in  Figs.  7-10. 
Among  these  policies.  P4  (7  =  10)  induces  significantly  lower  D1  compared  to  that  under  V\  and  Vi  and. 
thus,  it  is  the  most  effective  in  the  sense  that  it  minimizes  D1  while  satisfying  the  constraint  on  LF . 

Results  for  finite  capacity  N  =  50  and  Poisson  packet  arrival  processes  are  shown  in  Figs.  1 1  and  12. 
For  policy  V4.  the  thresholds  7  =  15  and  7  =  25  have  been  considered.  Results  for  N  =  50  and  Bernoulli 
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p,o.  7.  Mean  packet  delay  Dl  and  loss  probability  if  under  Bernoulli  packet  arrivals  (M  =  29.  f  =  0.5).  Curve  (/) 
corresponds  to  policy  V,  \  (4a)  and  (4b)  correspond  to  V*  with  7  =  10  and  7=15.  respectively.  (/)  ^  (j)  indicates  that  the 
results  under  policies  Vi  and  V}  are  approximately  the  same. 


Fig.  8.  Mean  packet  delay  Dx  and  loss  probability  L*  under  Bernoulli  packet  arrivals  (N  =  29,  a1  =  0.5).  Curve  (/) 
corresponds  to  policy  V,:  (4a)  and  (4b)  correspond  to  V*  with  7=10  and  7=15.  respectively.  (/)  ^  (j)  indicates  that  the 
results  under  policies  Vi  and  Vj  are  approximately  the  same. 

arrival  processes  have  not  been  presented  since  most  of  policies  V\-V$ — with  the  exception  of  V^  and  Va 
over  certain  ranges  of  traffic  loads — induce  very  low  loss  probability  at  .  In  this  case,  the  accuracy  of 
the  derived  loss  probabilities  depends  on  the  numerical  precision  of  the  machine. 

The  queueing  system  introduced  and  studied  in  this  work  could  model  the  queueing  behavior  of  a 
protocol  which  allocates  a  common  resource  to  two  traffic  classes  with  different  Quality-of-Service  (QoS) 
requirements.  One  class  may  be  viewed  as  a  best  effort  class  [15]  and  the  second  as  having  more  stringent 
QoS  requirements.  A  sufficiently  large  buffer  ( Q} )  can  be  assumed  to  be  available  to  the  best  effort  traffic 
called  also  Available  Bit  Rate  (ABR)— for  which  a  mean  value  of  the  induced  queueing  intensity  (such 
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Fig.  9.  Mean  packet  delay  Dl  and  loss  probability  under  Poisson  packet  arrivals  (.V  =  29,  X?  =  0.5).  Curve  (/)  corresponds 
to  policy  V,:  (4a)  and  (4b)  correspond  to  V+  with  7  =  10  and  7=15.  respectively. 


Fig.  10.  Mean  packet  delay  Dl  and  loss  probability  Lf  under  Poisson  packet  arrivals  (/V  =  29.  X1  =  0.5).  Curve  (/) 
corresponds  to  policy  V,\  (4a)  and  (4b)  correspond  to  Vi  with  7  =  10  and  7=15.  respectively. 

as  delay)  would  be  meaningful.  For  the  traffic  with  more  stringent  QoS  requirements  a  finite  buffer  ( Q F) 
could  be  considered  to  capture*  for  instance,  a  requirement  associated  with  a  maximum  tolerable  delay. 
For  example,  if  a  packet  delay  exceeding  N  (slots)  is  unacceptable,  then  it  is  evident  that  a  buffer  capacity 
larger  than  /V  is  meaningless.  Furthermore,  the  cell  loss  probability  would  serve  as  a  lower  bound  on  the 
probability  that  a  packet  is  delayed  beyond  N  slots.  Under  the  HoL  priority  policy  for  £)F,  the  packet  loss 
probability  and  the  probability  that  a  packet  is  delayed  by  more  than  N  slots  will  coincide,  assuming  that 
such  packets  do  not  receive  service. 

Finally,  the  partial  ordering  of  the  proposed  policies  can  provide  for  an  efficient  search  for  a  policy  which 
delivers  a  desired  performance.  At  first,  the  Head-of-Line  (HoL)  priority  for  one  class  would  establish  if 
there  exists  any  policy  in  the  class  which  would  induce  the  target-performance  (or  desired  Quality-of- 


30 


I.  Stavrakakis.  5.  Tsakiridou/ Performance  Evaluation  29  (1997)  15-33 


Fig.  11.  Mean  packet  delay  Dl  and  loss  probability  Lf  under  Poisson  packet  arrivals  (.V  =  50.  =  0.5).  Curve  (i ) 

corresponds  to  policy  V, ;  (4a)  and  (4b)  correspond  to  V*  with  T  =  15  and  T  =  25.  respectively. 


Fig.  12.  Mean  packet  delay  D1  and  loss  probability  Z.F  under  Poisson  packet  arrivals  (/V  =  50.  =  0.5).  Curve  (/) 

corresponds  to  policy  V, :  (4a)  and  (4b)  correspond  to  Pa  with  7=15  and  7  =  25.  respectively. 


Service)  for  this  class.  If  it  exists,  then  Proposition  2  could  provide  guidance  in  the  search  for  an  acceptable 
policy.  It  may  not  identify  the  best  policy  or  even  find  an  existing  acceptable  policy,  but  it  is  one — possibly 
effective — alternative  to  the  exhaustive  search.  By  considering  different  starting  policies,  the  entire  space 
could  be  searched  in  a  non-exhaustive  manner  by  moving  'toward*  the  better  policies  as  indicated  by 
Proposition  2. 


6.  Conclusions 

The  main  contribution  of  this  paper  is  the  introduction  and  study  of  the  versatile  class  of  service  policies  5 
based  on  the  concept  of  the  state-policy.  Through  the  construction  of  service  policies  in  terms  of  policy-lines 
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on  the  space  of  state-policies  SV  (Fig.  2).  the  flexibility  in  designing  policies  in  S  is  illustrated.  Through 
the  introduced  simple  performance  ordering  of  service  policies  a  direction  for  the  design  of  service  policies 
in  S  whose  performance  approach  a  desired  level  is  presented.  This  indicates  the  policy  design  flexibility 
and  potential  for  closely  achieving  a  desired  level  of  performance  associated  with  the  introduced  class  of 
service  policies.  The  latter  have  been  illustrated  through  some  examples  and  numerical  results. 


Appendix  A.  Proof  of  Propositions  2  and  3 

A.  1.  Proof  of  Proposition  2 


It  suffices  to  show  that  under  any  realization  of  the  packet  arrival  process 
<7™  <<?[•*  (or  ^<^1 


for  all  k  >  0.  Consider  an  initially  empty  system.  Let  mark  the  (end  of  the  time)  slot  at  which  the  first 
packet  arrives  to  Q[\  clearly. 


F.r? 

Qr\ 


- 

-  Qt{  * 


since  uninterrupted  service  to  QF  has  been  provided  under  both  policies  up  to /| .  Condition  (16)  implies  that 

the  service  horizons  under  policies  Z  and  Q—  denoted  by  r £{q*'Z)  and  rn(r//j‘f?),  respectively— will 
satisfy 


>. 


and  thus. 


Q  / 


i.r 


i.a 


<7 , 


l\  <  /  < 


l?. 


since  a  packet  from  Q 1  is  served  under  policy  Z  no  latter  than  under  policy  f?;  tp  =  t\  +  r^(c/F  ^)  -f  I 

marks  the  slot  at  which  the  server  serves  the  first  packet  in  Ql  under  policy  Q .  At  tp  the  following  cases 
may  be  distinguished. 

(a)  If  q{'P  =  0,  then  it  is  easy  to  see  that  q['?  =  0,  since  the  former  implies  that  only  one  packet  arrival 
to  Ql  has  occurred  up  to  tp .  Let  /[  mark  the  arrival  time  of  the  next  packet  to  Q1.  Then, 


i.r 


I.r?  r\ 
Qi  —  Qi  —  0 


t?< 


t  <  /. 


Thus,  it  has  been  proven  that 

w'here  [/* ,  /[ )  denotes  the  time  interval  of  an  I-cycle,  defined  to  be  the  interval  between  consecutive 
arrivals  to  Q 1  which  find  the  buffer  empty  under  both  policies  Z  and  Q. 
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(b)  If  qX'£  >  0.  then  it  is  easy  to  see  that  the  time  to  at  which  the  second  packet  is  forwarded  to  the  head 
t^‘ 

of  Ql  under  policy  Z  will  satisfy 

t\  -f  zE(qf;£)  +  1  =  tf  <  ‘0  <  tP. 

The  departure  time  instant  of  the  second  packet  under  policies  Z  and  £?.  will  be 
tf  =  t0  +  zE(qf0E)  +  1  and  tp  =  tP  +  t^(q^)  +  1. 
respectively.  Since  in  view  of  (16). 

rE(q ,„•*)  <  r *VA 

*  i 


it  is  implied  that  tp  <  tP  and 


l.r  .  l.r? 

q,  <  <],  ■ 


.n 


(A.  1 ) 


(A. 2) 


'0  <  t  <  '  }  ■ 

By  reiterating  the  argument  presented  above  until  Ql  becomes  empty  under  both  policies,  it  is  estab¬ 
lished  that  (A. 2)  holds  at  any  time  instant  of  an  l-cvcie  with  an  arbitrary  number  of  packet  arrivals  to 

<2'- 

Parts  (a)  and  (b)  above  complete  the  proof  of  Proposition  2. 


A.2.  Proof  of  Proposition  3 


The  proof  can  be  established  by  following  the  proof  of  Proposition  2.  The  introductory  part  and  part  (a)  of 
that  proof  is  directly  applicable.  In  order  for  part  (b)  to  be  valid,  the  validity  of  (A.  1 )  needs  to  be  established. 
While  condition  (16)  in  Proposition  2  guarantees  that  for  any  values  of  q^1  and  inequality  (A.  I )  will 
hold  true,  condition  (20)  in  Proposition  3  would  also  guarantee  it  provided  that 


F.r  .  F.f? 

9/„  ^  • 


(A. 3) 


That  is,  provided  that  the  finite  queue  occupancy  when  the  second  packet  is  forwarded  to  the  head  ot  (21 
under  policy  Z  be  not  less  than  under  policy  Q.  Since  Q F  will  have  been  reduced  by  tp  —  to  at  tp  under 
policy  n  compared  to  its  value  at  to  under  policy  Z  and  the  potential  new  packet  arrivals  to  (?F  over  the 
interval  (t0.  tp)  will  be  at  most  tp  -  r0  (condition  (19)),  it  is  evident  that  (A.3)  will  hold  true.  Reiterating 
the  argument  for  the  packet  arrival  to  the  head  of  Ql  that  may  follow,  the  proof  of  Proposition  3  can  be 
established. 
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Abstract 

This  paper  presents  a  flexible  traffic  model  which  is  capable  of  describing  traffic  dynamics  on  a  variety  of  time-scales 
associated  with  broadband  packet  networks  such  as  ATM.  The  model  is  applied  to  the  study  of  Non-Deterministic  Periodic 
(NDP)  sources  as  well  as  Variable  Bit  Rate  (VBR)  video  sources  in  which  cells  are  delivered  at  a  sub-frame  level  referred  to 
as  a  slice.  An  efficient  numerical  technique  is  presented  for  the  study  of  a  finite-capacity  multiplexer  under  an  input  process 
determined  by  the  superposition  of  /V  of  the  proposed  traffic  models.  Numerical  results  support  the  assertion  that  the 
time-scales  at  which  variations  in  a  cell  arrival  stream  occur  have  a  significant  impact  on  multiplexer  performance,  thereby 
illustrating  the  relevance  of  the  proposed  modeling  and  analytical  techniques. 

Keywords:  Time  scales:  Traffic  modeling;  Queueing  analysis:  Periodic  sources;  ATM 


1.  Introduction 

The  desire  to  accommodate  a  diverse  mixture  of  user  traffic  in  a  Broadband  Integrated  Services  Digital 
Network  (B-ISDN)  has  led  to  an  intense  concentration  of  research  efforts  on  technologies  which  allow  for 
traffic  multiplexing,  such  as  the  Asynchronous  Transfer  Mode  (ATM).  Performance  evaluation  of  statistical 
multiplexers  under  a  variety  of  input  processes  and/or  service  policies  (see  [1-6)  and  the  references  therein)  has 
been  essential  in  determining  the  achievable  effectiveness  of  statistical  multiplexing. 

The  development  of  efficient  congestion  control  and  call  admission  procedures  is  an  area  in  which 
multiplexing  analysis  and  network  traffic  characterization  continue  to  play  an  important  role.  Consequently,  the 
formulationof  accurate  traffic  models  which  lend  themselves  to  analytical  studies  is  a  topic  of  ongoing  research. 
In  a  high-speed  integrated  services  networking  environment,  accurate  traffic  models  should  have  the  ability  to 
model  variations  in  the  input  cell  arrival  process  on  a  variety  of  different  time-scales.  These  time-scales  may 
represent  time-intervals  (in  network  time  slots)  over  which  logical  information  blocks  arc  delivered  to  the 
network. 
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For  instance,  consider  a  Variable  Bit  Rate  (VBR)  video  coder  [7-9].  Bits  generated  over  a  frame  form  a 
logical  block  of  information  which  could  provide  a  basis  for  the  description  of  die  generated  traffic  as  well  as 
the  desirable  QoS.  Notice  that  if  the  VBR  video  coder  pre-buffers  and  uniformly  distributes  cells  in  each  frame, 
then  the  cell  activity  over  a  frame  can  be  described  in  terms  of  a  Fixed  cell-arrival  rate  [9],  and  a  relevant 
time-scale  may  be  that  of  a  video  frame.  If  pre-buffering  at  the  frame  level  is  not  feasible  (or  desirable),  due  to 
the  induced  frame  delay,  pre-buffering  and  uniform  spreading  over  a  shorter  time-scale  (referred  to  as  a  slice) 
may  be  necessary.  In  this  case,  the  cell  arrival  rate  over  a  slice  will  be  constant  and  changes  in  the  cell  arrival 
rate  process  will  occur  within  the  frame  at  slice  boundaries.  Note  that  the  absence  of  frame-level  pre-buffering 
has  been  shown  [10-13]  to  give  rise  to  cell  arrival  autocorrelation  functions  which  are  pseudo-periodic,  as 
opposed  to  the  monotonic  autocorrelation  functions  arising  from  frame-level  pre-buffering  schemes  [7]. 
Packetized  voice  provides  yet  another  example  of  a  common  traffic  source  time-scale  when  one  considers  the 
duration  of  time  required  to  generate  enough  bits  to  form  a  cell.  The  resulting  time-scale  is  then  representative 
of  the  minimum  cell  inter-arrival  time  associated  with  the  source. 

When  traffic  sources  such  as  those  mentioned  above  are  multiplexed  onto  a  network  link  of  capacity  C.  the 
time-scale  7  associated  with  each  source  can  be  seen  as  the  one  at  which  the  network  perceives  events  of 
interest  (i.e.  frame,  slice,  cell  arrivals)  to  occur,  and  can  be  easily  expressed  in  units  of  slots.  For  instance,  when 
a  voice  source  generating  information  at  a  peak  rate  of  r  bits/sec  is  multiplexed  onto  an  ATM  link  with 
capacity  C  bits/sec,  a  relevant  time-scale  for  the  source  is  given  by  the  ratio  C/r. 

In  this  paper,  a  stochastic  traffic  model  is  presented  which  is  capable  of  capturing  the  cell  arrival  dynamics 
associated  with  a  wide  range  of  deterministic  time-scales.  The  model  is  based  on  an  underlying  ///-state  periodic 
Markov  chain  in  which  state  transitions  occur  every  Ty  **  1  time  slots.  While  in  a  given  state,  cells  are  generated 
independently  from  slot  to  slot  according  to  an  arbitrary  distribution.  Two  time-scales  of  length  7]  and  mTs  may 
be  easily  identified  in  this  traffic  model.  A  computationally  tractable  performance  analysis  of  a  finite-capacity 
multiplexer  is  developed  under  input  traffic  determined  by  the  superposition  of  N  such  traffic  sources.  The 
technique  involves  the  decomposition  of  a  potentially  large  irreducible  Markov  chain  into  smaller  irreducible 
Markov  chains  whose  dimensionality  is  independent  of  m  and  7,.  Since  the  time-scales  Tx  and  m7{ ‘  are  expected 
to  increase  as  the  network  speed  increases,  this  decomposition  is  central  to  the  tractability  of  the  analysis 
approach  in  a  high-speed  networking  environment.  The  model  is  applied  to  the  study  of  Non-Deterministic 
Periodic  (NDP)  cell  streams  as  well  as  to  VBR  video  which  is  multiplexed  at  slice  level. 

In  the  following  section,  the  proposed  traffic  model  is  described  in  detail  in  terms  of  a  generalized  periodic 
Markov  chain  and  the  finite  capacity  multiplexer  is  analyzed  in  discrete  time.  Subsequent  sections  will  present 
the  applications  mentioned  above,  as  well  as  a  discussion  of  relevant  past  work. 


2.  Source  characterization  and  queueing  analysis 

In  the  next  subsection  the  proposed  traffic  model  is  described  in  terms  of  a  generalized  periodic  Markovian 
source  and  the  generated  traffic  is  characterized.  The  resulting  sub-period-level  autocorrelation  function  for  a 
source  is  derived  in  Section  2.2.  In  Section  2.3,  the  superposition  of  N  such  traffic  sources  is  considered,  and 
the  aggregate  traffic  is  described.  Finally,  a  finite-capacity  multiplexer  fed  by  N  generalized  periodic  Markovian 
sources  is  studied  in  the  last  subsection.  The  fundamental  time  unit  is  taken  to  be  the  transmission  time  of  the 
fixed  size  cell,  or  the  duration  of  a  time  slot  on  the  output  link  as  denoted  by  A  (in  seconds). 

2.1 .  Description  of  the  proposed  traffic  model 

Let  {///*,  7/,  /*(■)]  I  <  m*}  denote  a  set  of  parameters  associated  with  the  k th  traffic  source,  1  <  k  <  N, 
where  N  denotes  the  total  number  of  traffic  sources  to  be  multiplexed  and  the  parameters  arc  defined  as  follows: 
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[—  time  (in  suthperiods) 
rum 


— | - 1 - - 1 —  time  (in  slots) 

(n+1)Ts  (ru2)Tt 


Fig.  1.  The  relevant  time-scales  associated  with  the  GPNt  source. 


The  number  of  states  in  an  underlying  periodic  Markov  chain  associated  with  the  k th  source.  Let  i ir‘ 
denote  the  stationary  probability  that  this  periodic  chain  is  in  state  and  p\y  denote  the  /i-step 
transition  probability  from  state  i  to  state  j. 

Tk:  An  integer  constant  (called  sub-period)  which  represents  the  sojourn  time  (in  slots)  for  each  stale  in' the 

periodic  Markov  chain.  In  other  words,  a  transition  occurs  every  7/  time  slots  (or  7j*J  seconds),  and  the 
kth  source  has  a  period  of  length  7‘  =  mi*7,‘  slots. 

/*(.)  ;  The  probability  mass  function  (PMF)  for  the  number  of  cells  generated  in  each  slot  by  the  kth  source 
when  that  source  is  in  state  j.  1  <  nik.  Akj  will  denote  the  mean  number  of  cell  arrivals  per  slot  for 

source  k  in  state  j,  and  Nk  will  denote  die  maximum  number  of  such  arrivals.  A//  takes  values  from  2°, 
where  Z°  denotes  the  set  of  non-negative  integers. 

A  traffic  source  described  in  terms  of  {m*.  7,*.  //(Oil  »«*}  will  be  referred  to  here  as  a  Generalized 

Periodic  Markov  (GPM)  source.  It  may  easily  be  established  that  die  mean  arrival  rate  for  source  k  is  given  by 


I 


A‘  =  —  L  A*. 

7-1 


(0 


nr 


For  the  remainder  of  the  paper,  it  will  be  assumed  that  all  sources  have  the  same  sub-periods;  hence,  Tk  will 
be  denoted  simply  by  T%,  In  addition  to  the  parameters  defined  above,  let  ck  (referred  to  as  the  initialization 
lime)  denote  the  time  instant,  defined  at  the  beginning  of  a  sub-period,  at  which  the  A*th  source  was  activated 
(from  state  I),  assuming  that  1  <  ck  <  nik . 

Let  A*  denote  the  number  of  cells  arrived  over  sub-period  n.  n>  1  (interval  [( /i  -  1)7,,  nTt))\  yk  will 
denote  the  mean  arrival  rate  over  this  interval.  Arrivals  are  assumed  to  be  declared  at  the  end  of  the  time  slot 
over  which  they  occur  and  the  mean  arrival  rate  over  the  nth  sub-period  is  given  by 


yk  =  “  A*  where  j  =  n  mod  ( mk  +  1 )  -  ck  +  1 ,  (2) 

and  n  is  assumed  to  be  greater  than  or  equal  to  c*.  Clearly,  if  n  <  c*.  then  the  source  has  not  yet  been  activated, 
and  yk  =  0. 

Note  that  the  proposed  source  is  capable  of  modeling  the  cell  arrival  dynamics  of  integrated  services  traffic 
sources  at  lime-scales  of  one,  Tk  (sub-period)  and  Tk  (period)  slots  (sec  Fig.  1).  Since  T%  can  be  greater  than  or 
equal  to  1,  the  proposed  traffic  model  is  capable  of  capturing  variations  in  the  input  process  on  many  different 
time-scales.  When  T%  =  1,  for  instance,  the  proposed  source  model  reduces  to  the  periodic  Markov  chain  model 
considered  in  [14],  where  the  first  and  second  moments  of  the  queue  length  process  arc  determined  for  an 
infinite-capacity  queue  in  discrete  time. 


2.2.  The  sub-pcriod-lcucl  autocorrelation  function 

In  this  section,  the  cell  arrival  autocorrelation  function  for  a  GPM  source  will  be  considered.  In  order  to 
simplify  notation,  the  source  index  k  will  be  dropped  for  all  quantities  associated  with  the  GPM  model.  The 
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normalized  autocorrelation  function  /?(n),  sometimes  referred  to  as  the  correlation  coefficient  at  lag  /i.  where  n 
is  indexed  at  sub-period  boundaries,  is  defined  as 


R(n)± 


eM„- 

E{4}-EjMJ 


(3) 


This  function  is  derived  in  Appendix  A  for  the  case  of  Bernoulli  arrival  processes  within  each  sub-period, 
and  is  given  by  the  expression 
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m  ,-i  i 

|  -  A2 
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where  j  =  (/  +  n  —  1)  mod(w)  +  1,  and  8n  is  the  Kronecker  delta. 

The  autocorrelation  function  for  cells/sub-period  will  not  be  used  to  characterize  the  GPM  sources  until 
Section  3.2,  where  the  arrival  PMF  /;(«)  for  each  sub-period  is  assumed  to  be  Bernoulli.  For  arrival  processes 
other  than  Bernoulli,  the  expression  for  £2  given  in  (25)  will  be  different  and  Eq.  (4)  will  require  slight 
modifications. 


2.J.  The  superposition  of  N  GPM  traffic  sources 


Before  considering  a  statistical  multiplexer  fed  by  a  number  of  the  proposed  GPM  sources,  the  aggregate 
input  traffic,  which  is  given  by  the  superposition  of  the  individual  sources,  will  be  characterized.  A  useful 
property  of  the  GPM  traffic  model  is  that  it  is  closed  under  superposition.  That  is,  the  superposition  of  /V  GPM 
sources,  of  equal  sub-periods  7S,  is  also  a  GPM  source  with  an  underlying  periodic  Markov  chain  of  some 
period  M.  Note  that  since  the  initialization  lime  ck  is  defined  at  sub-period  boundaries,  it  is  implicitly  assumed 
that  the  sub-period  boundaries  are  synchronized  among  all  N  sources.  The  sources  arc  said  to  be  completely 
synchronized  when  c1  =  •  •  •  =  c‘v. 

It  is  easy  to  show  that  the  period,  M,  of  the  GPM  source  associated  with  the  aggregate  source  is  given  by 
Af=LCM{ml,  (5) 

where  LCM  {*)  denotes  the  least  common  multiple  of  its  argument. 

The  PMF  fj(-)  associated  with  state  j  of  the  aggregate  GPM  source,  1  <  M,  is  easily  evaluated  through 

the  jV-fold  convolution 


The  quantities  ak  are  functions  of  both  j  and  the  initialization  constant  c*,  and  are  given  by 


(j  4-  c*  —  1  if  1  <  c*t 

j  —  ck  +  1  if  ck  ^  j  <  ck  +  mk  t 

j  -  mk  -  ck  +  1  if  ck  +  mk  <  j  <  M . 


(6) 

(7) 


2.4 .  Multiplexer  performance  analysis 

In  this  section  a  First-Come  First-Sen/ed  (FCFS)  multiplexer  fed  by  N  GPM  sources  (Fig.  2)  is  analyzed  in 
discrete  time;  its  buffer  capacity  is  assumed  to  be  finite  and  is  denoted  by  B. 
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Fig.  2.  The  finite-capacity  queueing  system. 


It  is  assumed  that  cell  departures  occur  before  new  arrivals  arc  counted  at  the  end  of  each  time  slot.  The 
system  state  is  defined  at  sub-period  boundaries  in  terms  of  the  quantities  {qk,  .rt),  where  </,  represents  the 
queue  length  (multiplexer  occupancy)  at  the  end  of  sub-period  A  (time  instant  AT, ),  and  xt  denotes  the  state  of 
the  aggregate  source  at  that  time.  It  is  easy  to  establish  that  the  stochastic  process  (r/4 .  .r4)t  „  ,  is  a  Markov  chain 
with  state  space  ((/,  y)|0  <  /  <  B,  I  Hj  <  A/),  where  /  ( j)  denotes  the  current  state  of  qk  (.r4).  The  probability 
that  the  Markov  chain  (qk,  jct}4  >  ,  moves  from  state  (i.  j)  at  time  k  (defined  at  sub-period  boundaries)  to  state 
(/’,  /)  at  time  k  +  1  will  be  denoted  by  p(i,  j.  /'.  /).  The  corresponding  transition  matrix  is  sparse  due  to  the 
simple  periodic  structure  of  the  input  process  { .r4 }t  ?  ,  and,  in  fact,  a  fraction  of  (A/  —  1 )/ Af  of  the  total  number 
of  elements  are  zero. 

By  mapping  p(i,  j.  i".  /).  for  0  <  i,  i'  <  B  and  1  /  <  Af.  into  pU.  I')  tlirough  the  transformation. 

l=(j-  lXB  +  1)  +  /  and  /'  =  (/  -  lX5  +  1)  +  /',  the  resulting  transition  matrix,  P,  with  elements  p(l,  I'), 
takes  the  canonical  form  of  an  irreducible  periodic  Markov  chain  of  period  A /  [15],  given  by 
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The  following  proposition  allows  for  the  easy  determination  of  the  square  matrices  P;.  1  Af.  which  are 
of  order  (B  +  1). 


Proposition  1.  Given  some  initial  value  for  qk  at  the  beginning  of  the  (k  +  l)rr  sub-period  ( system  state  is 
(qk  —  i,  xk  —  j)),  the  finite  queue  will  evolve  exactly  as  an  Geo/D/ 1/B  queue,  with  i.i.d.  batch  arrivals  given 
byfj,  over  the  ( k  +  l)rr  sub-period  ( interval  [AT,,  AT,  +  7,]).  □ 


In  view  of  Proposition  1.  it  is  easy  to  sec  that  the  matrices  P )  are  simply  the  7',-step  transition  probability 
matrices  for  a  finite  queue,  under  the  i.i.d.  batch  arrival  process  specified  by  /{■),  where  1  <y<Af.  The 
one-step  transition  probabilities  pj(i,  /’),  0  <  i,  i*  <  B,  are  given  below. 

For  0  <  i  <  B  and  max(0,  f  —  I )  <  i  <  B  -  1 , 


Pj(  i.n  =/■(«'-/+!)  +/»•!„*,- -0)- 


where  1(.(  denotes  the  indicator  function. 
For  0  <  i  <  B  and  i'  =  B, 


(9) 


p,o\  n=  i  />(o.  o°) 

I 

where  A'-  denotes  the  maximum  number  of  arrivals,  as  determined  by  (6),  when  the  source  is  in  state  j.  The 
elements  of  P  ,  denoted  p)T,)(i,  /').  can  now  be  calculated  by  using  the  recursion 

p^v'n-LpY'-'K^n-pjU-n-  (") 
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It  is  important  to  note  that  the  dimensionality  of  the  transition  matrix  P.  which  is  M{B  +  \)XM(B  +  1), 
prohibits  the  solution  of  systems  with  input  processes  having  large  periods,  and  consequently,  large  values  of 
A/.  The  following  theorem,  however,  provides  for  the  exact  stationary  solution  of  {</*.  .t4}4>  ,  by  solving  M 
Markov  chains  with  transition  matrices  of  order  (5  4-  1)  and  thus,  significantly  reduces  the  numerical 
complexity  involved  in  calculating  the  stationary  probability  vector  of  the  transition  matrix  in  (8),  Its  proof  can 
be  found  in  [16]. 


Theorem  2.  The  finite  irreducible  periodic  Markov  chain ,  with  transition  matrix  given  by  (8),  has  a  unique 
stationary  probability  vector ,  77=  (7r0J  •  *  *  ttb  w)\  given  by 


77  ~ 


1 


77  , 
77  •> 


(12) 


where  77.  is  the  unique  stationary  probability  vector  for  A-  =  P,  •  •  PAf  •  P,  *  •  ■  P,  _  ,  0  =  2, - M ),  and 

A  j  =  Pj  ■PAf.  □ 


Upon  solving  for  77  =(770I  ttb  Kf)f  using  Theorem  2,  the  queue  length  probability  distribution  at 
sub-period  boundaries  is  easily  evaluated  as 

M 

q(k)=  lim  Pr{ q„  =  k)  =  E  irk  y.  k  =  0 . B.  (13) 

"-x  i- 1 

Let  irk  ■  denote  the  stationary  probability  that  the  system  is  in  state  (k,  j)  at  an  arbitrary  time  instant  (slot 
boundary).  These  quantities  are  derived  in  Appendix  B  through  the  use  of  a  rencwal/regenerative  theory-based 
approach.  The  queue  length  distribution  q(k)  at  arbitrary  time  instants  can  now  be  computed  as  in  (13).  Note 
that  when  f  =  1  (sub-period  is  equivalent  to  the  slot),  the  queue  length  distribution  at  arbitrary  times  is  simply 
q(k)  —  q(k)  for  0  <  k  <  B. 

The  cell-loss  probability  L  is  also  derived  in  Appendix  B  using  the  same  approach  followed  for  the 
derivation  of  irk  and  is  given  by 

L  =  Pr{ arriving  cell  is  dropped} 

E  0/>(5  + 1  +  ('4) 

r- 0  /-I  n  -  I  i-/, 

where  /,  =  max(0,  B  +  2- N),  K{  =  max(0,  i  -  B  -  1),  K2  =  N  -  B  -  1  + 1  and  A  denotes  the  total  offered 
load,  which  is  given  by 

A=  E  A‘.  (15) 

k-  1 


3.  Application  1:  Non-Deterministic  Periodic  (NDP)  sources 

An  important  characteristic  of  high-speed  packet  networks  such  as  ATM,  which  is  not  Considered  in  the 
relevant  performance  studies  of  [1-6]  is  the  potentially  large  disparity  in  the  relative  speeds  of  an  output  link 
and  an  input  source.  This  disparity,  or  speed-up  factor  -  commonly  appearing  at  the  network  edges  where  a 
number  of  sources  are  being  multiplexed  onto  an  output  link  -  results  in  successive  cells  from  an  active  source 
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being  spaced  by  the  interval  of  time  required  to  generate  enough  bits  to  form  a  cell.  The  speed-up  factor 
mk  =  C/rk,  where  C  is  the  link  speed  (in  slots/sec)  and  rk  is  the  source  speed  is  also  relevant  in 
inter-networking,  where  a  number  of  slower  packet  networks  of  various  speeds  are  connected  to  the  B-ISDN 
through  a  common  gateway.  In  addition,  it  has  been  shown  in  [17]  that,  in  a  networking  environment  where 
input  and  output  lines  are  of  the  same  speed,  source  periodicities  like  those  mentioned  above  give  rise  to  cell 
arrival  streams  within  the  network  which  also  exhibit  significant  periodicities.  This  observation  has  been  used  in 
[18]  to  justify  the  adoption  of  periodic  input  cell  arrival  streams  for  the  simulation  study  of  an  ATM  switch. 

One  way  to  study  the  speed-up  factor  analytically  is  by  considering  the  multiplexing  of  deterministic  periodic 
sources  [19,20],  which  implies  that  all  of  the  input  sources  are  Constant  Bit  Rate  (CBR).  A  more  realistic  traffic 
model  for  high-speed  packet  networks  such  as  ATM  would  be  a  Non-Deterministic  Periodic  (NDP)  model  in 
which  cells  are  generated  every  mk  slots  according  to  a  stochastic  process,  thereby  allowing  for  a  potential 
statistical  multiplexing  gain. 

A  relevant  question  that  arises  when  considering  the  multiplexing  of  NDP  sources  is  whether  or  not  the 
aggregation  of  such  sources  can  be  approximated  by  any  of  the  commonly  used  traffic  models.  For  instance,  it  is 
widely  accepted  that  the  superposition  of  a  large  number  of  uncorrelated  cell  arrival  streams  can  be 
approximated  by  a  Poisson  process,  which  allows  for  a  simple  analytical  solution  to  many  queueing  problems. 
However,  this  limiting  assumption  may  not  be  valid  when  the  cell  arrival  processes  have  a  periodic  structure.  In 
fact,  it  will  be  shown  that  for  the  superposition  of  NDP  sources,  the  Poisson  assumption  can  be  either 
pessimistic  or  optimistic,  depending  upon  the  operating  conditions. 

A  correlated  traffic  model  in  which  cells  are  generated  at  a  minimum  spacing  of  sk  slots  is  considered  in  [21] 
and  [22],  where  the  cell  anival  process  is  governed  by  an  underlying  2-statc  Markov  chain.  The  speed-up  factor 
sk.  which  is  equal  to  the  unit  step  transition  time  of  the  Markov  chain  for  source  k,  is  shown  to  have  a 
significant  impact  on  multiplexer  performance.  Specifically,  as  sk  increases,  the  inter-cell  correlations  present  at 
the  source  become  less  dominant  and  the  multiplexer’s  performance  improves  accordingly.  These  observations 
suggest  that,  for  sufficiently  large  values  of  sk.  it  may  be  possible  to  drop  the  correlated  source  model  in  favor 
of  a  simpler,  uncorrelated  model  without  significantly  impacting  on  the  model’s  accuracy. 


3.1  GPM  traffic  model  for  NDP  sources 


The  proposed  GPM  source  can  be  easily  adopted  to  model  such  an  uncorrelated  source  with  speed-up  factor 
mk  by  setting  T,  equal  to  1  slot;  the  Jfcth  source  is  then  described  by  the  parameters  (m‘,  I.  //(•)).  Note  that  this 
is  equivalent  to  setting  Tk  =  m‘.  Source  k  is  assumed  to  deliver  cells  every  m*  time  slots  according  to  a 
Bernoulli  PMF  with  rate  A*.  This  is  captured  through  the  model  by  setting,  for  j=  c\ 


and  for  j  +  cl. 


for  i=l. 
for  i  =  0. 


(16) 


fk(i)=( 0  for/=1’  (17) 

'  '  \  I  for  i  =  0. 

Note  that  the  analysis  presented  in  Section  2  provides  results  for  a  given  vector  of  source  start  times,  while 
the  analyses  of  [19]  and  [20]  assume  random  starting  times  among  the  sources.  For  the  deterministic  (periodic) 
source  models  assumed  in  [19]  and  [20],  random  starting  times  represent  the  only  stochastic  element  of  the 
resulting  queueing  system,  without  which  the  analysis  would  bo  trivial.  A  more  meaningful  treatment  of  the 
problem,  which  is  adopted  here,  may  be  one  in  which  specific  -  perhaps  worst-case  -  starting  times  can  be 
assumed.  A  stochastic  cell  generation  process  can  then  be  considered  at  the  deterministic  time-scales  »t*.  which 
is  more  representative  of  realistic  traffic  generators  such  as  packctizcd  voice  and  network  uplinks  with  speed 
mismatches. 
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3.2.  Numerical  results  for  NDP  input  processes 

Although  the  analysis  described  in  Section  2.4  allows  for  the  superposition  of  input  sources  having  different 
periods,  the  presentation  and  discussion  of  the  numerical  results  will  be  facilitated  by  considering  sources  with 
identical  periods  denoted  by  m.  Each  source  is  also  assumed  to  deliver  cells  according  to  a  Bernoulli  PMF  with 
rate  a,  which  results  in  a  mean  rate  per  source  of  a/m;  the  total  offered  load  is  then  given  by  A  =  Na/m. 

Fig.  3  illustrates  the  effect  that  the  source  period  has  on  celUoss  probabilities  as  a  function  of  the  buffer 
capacity  B  when  N  =  40  sources  are  multiplexed  at  an  offered  load  of  p  =  0.9  and  the  sources  are  assumed  to 
be  synchronized  (c1  =  •  •  •  =  c40).  Clearly,  the  source  period  nt  has  a  significant  impact  on  queue  performance 
when  the  total  offered  load  is  held  constant.  Note,  however,  that  this  impact  is  not  necessarily  negative  as  has 
been  suggested  in  the  past  [18].  In  fact,  for  “acceptable"  loss  probabilities  on  the  order  of  10"6  and  smaller,  the 
impact  that  increasing  values  of  m  have  on  the  queue  performance  is  positive.  The  curves  for  m  =  10,  20,  and 
30  are  also  compared  with  the  corresponding  M/D/l/B  approximation,  and  the  results  are  quite  enlightening. 
Fig.  3  reveals  that  the  Poisson  assumption  can  either  overestimate  or  underestimate  loss  probabilities  depending 
upon  buffer  capacity  and  the  source  period  m,  which  implies  that  the  solution  to  the  M/D/1  /B  queue  should 
not  be  used  for  bounding  purposes  when  considering  such  periodic  input  streams.  For  a  loss  probability  of  10“9. 
the  M/D/l/B  approximation  is  extremely  pessimistic,  requiring  a  buffer  capacity  of  B  =  90,  compared  to  a 
required  buffer  capacity  of  B  =  50  when  m  =  30.  In  fact,  the  negative  slope  of  the  M/D/1  /B  curve  is  always 
smaller  than  the  cases  for  m  >  1.  which  implies  that,  beyond  some  value  of  B  (which  varies  with  m),  the  loss 
probabilities  for  the  limiting  Poisson  traffic  assumption  upper  bounds  the  actual  loss  probabilities  for  the 
superposition  of  non-deterministic  periodic  sources. 

Fig.  4  demonstrates  the  relationship  between  cell-loss  probabilities  and  offered  load  for  a  case  in  which  the 
Poisson  assumption  is  optimistic  (B  =  30)  as  well  as  a  case  in  which  the  assumption  is  overly  pessimistic 
( B  =  50).  The  N  -  AO  NDP  sources  are  assumed  to  have  speed-up  factors  of  m  -  30  associated  with  them. 
When  B  —  30,  the  superposition  of  NDP  input  sources  results  in  loss  probabilities  larger  than  10"5  for  the  range 
of  loads  between  0.7  and  0.99.  The  corresponding  M/D/ 1/30  queue,  however,  is  overly  optimistic  and  results 
in  loss  probabilities  smaller  than  I0~9  for  loads  as  high  as  0.72.  On  the  other  hand,  when  B  =  50.  the 
M/D/ 1/50  approximation  overestimates  the  loss  probabilities  for  all  loads.  It  is  clear  from  Fig.  4  that  the 
Poisson  assumption  performs  poorly  for  low  to  moderate  loads,  and  even  for  offered  loads  of  0.99  there  is  a 
substantial  difference  between  the  M/D/l/B  queue  performance  and  the  corresponding  case  of  NDP  input 
sources. 


Fig.  3.  Loss  probability  versus  buffer  capacity  B  for  various  m.  The  number  of  sources  are  N  -  40,  and  the  total  load  A  -  0.9. 
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Fig.  4.  Loss  probability  versus  offered  load  for  various  B.  The  number  of  sources  arc  N  -  40.  and  speedup  factors  arc  set  to  m  —  30. 


Note  that  for  the  results  presented  in  Figs.  3  and  4,  it  is  assumed  that  the  NDP  sources  are  synchronized 
(c1  =  •  •  •  -  cs),  which  represents  a  worst-case  scenario  in  terms  of  queue  performance.  Although  it  is  difficult 
to  define  a  meaningful  distribution  of  initialization  times  ck  (l  <  k  ^  M)  for  the  NDP  sources,  it  is  reasonable 
to  expect  that  uniformly  distributing  these  times  will  improve  the  queueing  performance.  This  is  illustrated  in 
Fig.  5.  where  loss  probabilities  are  plotted  versus  the  number  of  multiplexed  sources  with  m  =  20  and  p  =  0.9 
for  synchronized  and  unsynchronized  (initialization  times  uniformly  distributed)  sources.  The  uppermost  curves 
represent  results  for  a  buffer  capacity  of  B  —  20,  and  the  lower  curves  for  B  —  50.  Similarly,  the  upper  star  on 
the  right  hand  vertical  axis  denotes  the  M/D/ 1/20  result  while  die  lower  star  denotes  the  M/D/ 1/50  result. 
For  both  cases  {B  —  70  and  B  =  50),  the  queue  performance,  in  terms  of  cell-loss  probabilities,  approaches  a 
limit  for  large  values  of  N.  However,  even  when  the  NDP  sources  are  unsynchronizcd,  the  limiting  results  are 
significantly  different  than  the  M/D/l/B  results. 

The  results  presented  thus  far  indicate  that  all  of  the  parameters  (£.  m%  N  and  A)  affect  significantly  the 
queue  performance,  and  consequently,  the  validity  of  the  M/D/l/B  approximation.  For  each  set  of  numerical 
results  there  seem  to  be  two  distinct  regions  of  operation;  one  in  which  the  source  periodicities  have  a  negative 
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Fig.  5.  Loss  probability  versus  N  for  various  It.  The  total  offered  load  is  A  “  0.9.  and  speedup  factors  are  set  to  m  -  20. 
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Fig.  6.  LP  //-jsiop  versus  B  for  different  values  of  m.  The  total  offered  load  is  A  **  0.9.  and  <V  —  75  sources  are  multiplexed. 

influence  on  the  queueing  behavior,  relative  to  a  Poisson  input  source,  and  one  in  which  the  periodicities  have  a 
favorable  impact.  These  operating  regions  are  examined  more  closely  in  Fig.  6,  where  the  ratio  of  the  loss 
probability  for  the  Poisson  traffic  source  (Z,P)  to  the  loss  probability  for  the  NDP  sources  (LS0P)  is  plotted 
versus  buffer  capacity  B.  When  the  ratio  LP/LS0P  is  less  than  (greater  than)  1,  the  M/D/l/B  approximation 
underestimates  (overestimates)  the  actual  loss  probability.  In  Fig.  6.  N  =  75  non-deterministic  periodic  sources 
are  multiplexed  at  a  total  offered  load  of  0.9  for  source  periods  of  1,  10  and  50.  When  m  =  1,  the  input  to  the 
multiplexer  consists  of  N  =  75  Bernoulli  sources,  and  the  ratio  LP/LSDP  is  close  to  l  for  all  values  of  Z?.  This 
result  supports  the  assertion  that  the  superposition  of  a  large  number  of  Bernoulli  traffic  streams  can  be 
approximated  by  a  Poisson  process  (asymptotic  behavior  of  the  Binomial  distribution),  regardless  of  the  buffer 
capacity.  However,  as  the  source  periods  increase,  the  two  regions  become  more  distinct.  When  ni  =  10  and 
B  -  10.  LNDP  is  5  times  greater  than  LP  for  the  M/D/1/10  queue,  and  when  B  —  100,  LNDP  is  5  times 
smaller  than  Lp .  For  this  case,  the  M/D/l/B  results  arc  overly  pessimistic  for  B  >  50.  When  ni  =  50,  the  two 
regions  arc  even  more  well-defined,  and  the  Poisson  assumption  does  not  perform  well  in  either  region.  It  is 
interesting  to  note  that,  as  was  the  case  in  Fig.  3,  the  region  of  operating  conditions  which  produces  loss 
probabilities  considered  acceptable  in  ATM  (less  than  10~6)  is  the  one  in  which  the  M/D/I/B  queue  provides 
overly  pessimistic  results  (LP  >  LNDP). 


4.  Application  2:  Multiplexing  VBR  video  at  slice-level 

While  the  modeling  of  VBR  video  sources  has  recently  received  significant  attention,  there  presently  exists 
no  widely  accepted  model  which  lends  itself  to  mathematical  analysis.  Most  existing  models,  like  those  in  [7] 
and  [8],  assume  that  video  is  pre-buffered  at  the  frame  level.  The  purpose  of  pre-buffering  is  simply  to  provide 


Video  Source  smooth  traffic 


Fig.  7.  Pre-buffering  a  VBR  video  source. 
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Fig.  8.  Slice-level  sample  of  a  VBR  video  sequence. 


the  network  with  a  cell  stream  which  is  as  smooth  as  possible;  this  can  be  accomplished  by  simply  storing  data 
temporarily  in  a  pre-buffer  as  in  Fig.  7. 

In  [l  l],  a  flexible  simulation  model  is  developed  which  captures  die  behavior  of  VBR  video  traffic  at  both 
the  frame  and  slice  level.  There  are  30  equal-sized  slices  in  every  frame,  and  the  inter-frame  as  well  as  die 
inter-slice  autocorrelation  functions  are  matched  to  those  from  experimental  data  provided  by  [12].  The  cell 
generation  rate  is  considered  to  be  constant  throughout  the  slice,  modeling  die  VBR  video  traffic  scheme 
delivered  to  the  multiplexer  under  a  slice-level  pre-buffering  scheme.  Through  a  simulation  study  driven  by  a 
number  of  source  models  which  match  the  characteristics  of  a  full-motion  video  movie.  [12],  the  audiors  show 
that  the  impact  of  inter-frame  correlation  is  negligible  while  the  impact  of  inter-slice  correlation  is  significant. 

These  simulation  results  suggest  that  if  pre-buffering  at  the  frame  level  is  not  practical,  and  transmission 
takes  place  at  the  slice  level,  then  an  effective  traffic  model  should  capture  the  time-scales  associated  with  both 
the  frame  and  the  slice.  The  former  will  determine  the  time-scale  at  which  inter-frame  correlations  exist,  while 
the  latter  will  determine  the  horizon  of  the  constant-rate  cell  delivery  to  the  multiplexer  (slice).  Note  that  under 
pre-buffering  at  the  frame  level  a  single  rate  of  the  resulting  traffic  over  a  frame  is  generated.  Under 
pre-buffering  at  the  slice  level,  and  for  a  selected  average  cell  rate  over  a  frame,  30  potentially  different  and 
correlated  rates  of  the  resulting  traffic  over  a  frame  are  produced;  each  rate  remains  fixed  over  a  slice. 

Due  to  the  high  positive  autocorrelations  in  the  number  of  cells  generated  in  a  frame,  the  cell  arrival  process 
(at  slice  level)  over  a  frame  can  be  considered  to  remain  virtually  unchanged  over  a  sequence  of  consecutive 
frame  horizons.  This  is  in  agreement  with  the  experimental  data  from  [12]  (Fig.  8)  in  which  the  slice-level 
arrival  rate  is  shown  to  have  a  periodic  behavior  (with  period  30  slices),  and  the  frame-level  arrival  process 
remains  relatively  constant  over  many  frames  (or  periods).  Assuming  that  the  time  horizon  associated  with 
changes  in  the  frame-level  arrival  rate  is  sufficiently  long  to  induce  the  steady-state  behavior  of  the  multiplexer, 
a  quasi-static  approximation  similar  to  that  in  [9]  can  be  used  to  study  the  multiplexing  of  VBR  traffic  which  is 
pre-buffered  at  the  slice  level  as  described  above.  However,  a  constant-rate  arrival  process  over  a  frame,  such  as 
the  Poisson  process  considered  in  [9],  will  clearly  not  be  applicable  here  to  approximate  the  arrival  process  over 
a  number  of  video  frames.  Instead,  assuming  a  frame-level  arrival  rate  which  remains  constant  over  many 
frames,  a  periodic  cell  arrival  process  (with  a  period  equal  to  the  frame  length)  within  each  frame  could  be 
considered. 

4.1 .  GPM  traffic  model  for  the  periodic  slice-level  arrival  process 

The  periodic  cell  arrival  process  described  above  can  be  easily  captured  through  the  proposed  GPM  traffic 
model  by  setting  the  period  of  each  GPM  source  ( mk )  to  be  30  and  the  sub-period  (Ts)  equal  to  the  number  of 


304 


R.  Latulry.  /.  Stavrakakis  /  Computer  Networks  and  ISDN  Systems  29  (1997)  293-313 


Fig.  9.  Sub* period -level  autocorrelation  function  for  A*  -  0.2.  b  *■  0.2  and  various  values  of  Tt. 


slots  in  a  slice;  Ts  varies  with  the  link  speed,  the  frame  rate  (in  frames/sec)  and  the  slot  capacity.  Assuming  a 
frame  rate  of  24  per  second,  as  in  (l  l]  and  [12],  and  a  slot  capacity  of  53  bytes  (48  bytes  of  payload  and  5  bytes 
of  header)  for  ATM, 

C  bits  1  byte  I  slot  1  frame  1  sec 

T  *= - - - -  ■  - - - - 

1  sec  8  bits  53  bytes  30  slices  24  frames 

C 

— -  slots/slice,  (18) 

305,280  '  K  * 

where  C  is  the  link  capacity  in  units  of  bits/sec.  To  capture  the  uniform  distribution  of  cells  within  a  slice,  the 
PMF  //(*)  for  the  number  of  cells  generated  in  each  slot  when  source  k  is  in  state  j  (1  30)  will  be 

Bernoulli  with  rate  A*.  Note  that  this  can  be  seen  as  statistically  smoothing  the  source  at  the  slice  level.  It  is 
believed  that  an  assumption  of  statistical  smoothing  should  provide  for  a  conservative  (or  pessimistic) 
estimation  of  delivered  QoS  if  the  VBR  source  were  deterministically  smoothed  (cells  are  delivered  to  the 
network  equi-spaced  within  each  slice).  This  has  been  shown  to  be  the  case  in  [23]  when  the  VBR  source  is 
pre-buffered  and  smoothed  on  a  frame  by  frame  basis. 

As  was  the  case  with  the  slice-level  arrival  rate  process,  the  slice-level  autocorrelation  function  for  a  VBR 
video  source  has  been  shown  [10,12,13]  to  possess  a  periodic  structure  with  a  period  of  30  slices.  The  GPM 
traffic  model  also  has  a  periodic  sub-period-level  autocorrelation  function  Rk(n)t  as  given  in  Eq.  (4).  In  fact,  for 
a  given  frame-level  cell  arrival  rate  for  source  k ,  denoted  by  A*  in  Eq.  (1).  the  slice-level  autocorrelation 
function  for  the  k\h  video  source  can  be  approximated  by  Rk(n)  through  an  appropriate  selection  of  the  arrival 
rates  A*,  1  <  30. 

In  order  to  produce  a  first  approximation  for  the  autocorrelation  function  for  cells/slice  which  is  similar  to 
the  functions  found  in  [24],  for  example,  the  following  distribution  of  the  rates  A*  will  be  considered  3: 

=  =  /  =  1,2 . 15.  (19) 


J  Noic  that  there  exist  many  possible  choices  for  the  rate  distribution  function  Af.  The  distribution  given  in  Eq.  (19)  is  intended  only  to 
serve  as  an  example,  and  is  not  intended  to  closely  match  any  experimental  data  set. 
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Fig.  10.  Sub-period-level  autocorrelation  function  for  A*  -  0.2.  T%  -  327  and  various  values  of  b. 


Upon  choosing  a  value  of  b  to  produce  the  desired  profile  for  the  autocorrelation  function,  the  constant  a  is 
calculated  using  the  expression 

A*=—  Ee-'-1’*.  (2°) 

15  | 

which  is  derived  from  (!)  by  requiring  that  the  mean  rate  over  states  I  through  15  (which  equals  the  mean  rale 
over  states  16  through  30)  be  equal  to  the  frame  cell  rate  A*  for  the  source.  Note  that  according  to  (4),  the 
autocorrelation  function  for  cells/sub-period  is  dependent  upon  both  Tx  and  the  sub-period  rates  A*.  Fig.  9 
shows  the  resulting  sub-pcriod-level  autocorrelation  functions  for  a  source  with  frame  cell  rate  A*  =  0.2,  b  =  0.2 
and  different  values  of  T%  ranging  from  1  to  327.  The  latter  corresponds  to  a  video  source  being  multiplexed 
onto  a  100  Mbps  ATM  link  (see  Eq.  (18)). 

Similarly.  Fig.  10  presents  sub-pcriod-level  autocorrelation  functions  for  the  source  with  Tt  —  327  and 
different  sub-period  rate  distributions  resulting  by  varying  b  from  0.01  to  0.20.  Note  that  as  b  approaches  0,  the 
GPM  source  looks  more  and  more  like  an  i.i.d.  cell  arrival  process.  In  fact,  when  b  -  0,  the  GPM  source  is 
simply  a  binomially  distributed  batch  arrival  process.  It  is  clear  from  Figs.  9  and  10  that  similar  sub-pcriod-level 
autocorrelation  functions  can  be  produced  by  either  fixing  b  and  varying  7^.  or  fixing  T%  and  varying  b. 

42.  Numerical  results  and  discussion  for  the  VBR  application 

In  this  section  some  numerical  results  are  presented  for  the  analysis  of  the  finite  capacity  multiplexer  with  an 
input  process  consisting  of  the  superposition  of  N  GPM  traffic  sources,  which  arc  adopted  as  explained  in  the 
previous  section  to  mode!  VBR  video  sources  pre-buffered  at  the  slice  level.  The  stationary  queue  distributions 
computed  using  the  techniques  presented  in  Section  2.4  yield  queueing  results  for  video  sources  generating 
traffic  according  to  a  single  average  frame  rate.  In  order  to  compute  the  queue  length  distributions  over  all 
possible  frame  rates,  a  frame  rate  histogram  can  be  derived  as  described  in  [9],  and  the  quasi-static 
approximation  presented  there  can  be  applied.  Application  of  the  quasi-static  approximation  is  beyond  the  scope 
of  this  paper.  The  primary  objective  here  is  to  determine  the  impact  of  periodic  slice-level  autocorrelations  on 
queue  response. 

To  address  the  effectiveness  of  the  slice-level  autocorrelation  function  in  describing  a  VBR  input  source, 
consider  the  functions  pictured  in  Fig.  1 1.  Although  the  two  autocorrelation  functions  are  almost  identical,  one 
is  produced  by  an  underlying  periodic  Markov  chain  which  makes  transitions  every  time  slot  (Ts  =  1),  while  the 


306 


R.  Landry .  /.  Staurakakis /Computer  Networks  and  ISDN  Systems  29  (1997)  293-3 I 3 


Fig.  I  I.  Sub-period-level  autocorrelation  functions  for  sources  with  A*  -  0.2.  and  different  values  of  Tt. 


other  is  produced  by  a  source  making  transitions  every  20  slots  (T%  =  20).  The  autocorrelation  functions  arc 
matched  by  simply  varying  the  distribution  of  sub-period  rates  through  the  parameter  b\  the  total  rate  for  each 
GPM  source  is  A*  =  0.2.  The  queue  response  to  these  sources  is  examined  by  considering  a  multiplexer  of 
capacity  £  =  30  cells  with  an  input  of  4  of  the  GPM  sources.  The  input  sources  with  Ts  =  20  slots  have  a  much 
more  negative  impact  on  the  multiplexer  than  those  with  1.  which  is  illustrated  in  Fig.  12  through  tail 
distributions  for  buffer  occupancy  as  well  as  loss  probabilities  L  It  is  clear  that  simply  matching  the 
autocorrelation  function  for  cells/slice  is  not  sufficient  to  estimate  multiplexer  performance.  The  actual 
time-scale  associated  with  the  slice  needs  to  be  considered  carefully. 

In  fact,  by  adopting  different  source  time-scales  T%%  performance  studies  can  lead  to  contrasting  conclusions 
about  the  feasibility  of  multiplexing  VBR  video  which  is  not  pre-buffered  at  the  frame  level.  For  instance,  based 
on  numerical  results  for  input  sources  with  periodic  autocorrelations  which  make  transitions  every  time  slot,  the 
author  in  [25]  concludes  that  loss  probabilities  are  higher  when  VBR  video  sources  arc  pre-buffered  and 
smoothed  on  a  frame  by  frame  basis.  A  similar  conclusion  would  be  drawn  using  GPM  sources  with  Tt  =  1.  For 


Fig.  12.  Tail  distributions  and  loss  probabilities  for  matching  autocorrelation  functions  with  Tx  -  l  and  20:  total  load  is  A  -  0.8  in  both 


cases. 
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Fig.  13.  Cell  loss  probabilities  for  N  -  4  GPM  sources,  5  -  40  and  7*,  of  I  and  20;  total  load  is  A  -  0.88. 


GPM  input  sources,  a  frame-level  pre-buffering  scheme  is  modeled  by  setting  constant  arrival  rates  over  all  30 
sub-periods  (or  slices),  which  is  the  case  when  b  =  0  under  the  rate  distribution  strategy  of  Eq.  (19).  Note  that 
this  eliminates  the  periodic  nature  of  the  slice-level  autocorrelation  function. 

In  Fig.  13  cell  loss  probabilities  are  plotted  versus  b  for  N  =  4  input  sources,  each  with  rate  0.22,  at  a 
multiplexer  of  capacity  B  =  40  for  7,  =  1  and  7,  =  20.  As  expected,  when  b  =  0  (which  models  frame-level 
pre-buffering)  the  rates  do  not  change  from  slice  to  slice  and  the  loss  probability  docs  not  depend  upon  7,. 
When  7=1,  notice  that  loss  probabilities  decrease  as  the  periodic  fluctuations  in  arrival  rate  increase  with  b. 
Therefore,  use  of  this  model  would  lead  to  the  false  conclusion,  as  in  (25],  that  the  periodic  autocorrelations 
provide  for  improved  statistical  multiplexing  gain  over  frame-level  pre-buffering.  As  7,  is  increased  to  as  little 
as  20  slots,  which  corresponds  to  an  output  link  of  capacity  C  =  6.1  Mbps  (Eq.  (18))  for  the  adopted  GPM 
model,  numerical  results  for  L  lead  to  a  completely  different  conclusion.  In  this  case  it  would  seem  that 
smoothing  the  periodic  autocorrelations  through  frame-level  pre-buffering  provides  greatly  improved  QoS. 
These  results  illustrate  the  importance  of  incorporating  time-scales  into  traffic  models  in  order  to  accurately 
assess  the  impact  of  the  modeled  traffic  sources  on  network  performance. 

In  order  to  investigate  a  more  realistic  multiplexing  scenario.  Table  I  depicts  cell  loss  probabilities  for  a 
buffer  capacity  of  B  =  50.  an  output  link  capacity  of  100  Mbps,  and  an  input  consisting  of  10  identical  VBR 
sources  each  transmitting  at  a  rate  of  8  Mbps.  This  scenario  can  be  modeled  with  GPM  sources  through  the 
following  choice  of  parameters:  7,  =  327.  A'  =  •  •  •  =  A10  =  0.08  and  of  course  =  »'10  -  30.  As 

explained  in  Section  4.1,  each  source  generates  cells  according  to  the  appropriate  Bernoulli  PMF  to  approximate 
the  smoothing  of  cells  within  a  slice. 

In  Table  1,  three  different  autocorrelation  functions  for  cells/slice  are  considered.  One  set  of  results 
represents  the  frame-level  pre-buffering  case  (b  =  0)  while  the  others  correspond  to  two  of  the  autocorrelation 


Tabic  I 

Loss  probabilities  for  different  values  of  b  under  both  synchronized  and  uniformly  distributed  frame  boundaries _ 

b  L  for  synchronized  frame  boundaries  L  for  uniformly  distributed  frame  boundaries 
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curves  pictured  in  Fig.  10  ( b  =  0.01  and  b  =  0.02).  Clearly,  the  periodic  autocorrelation  function  has  a  negative 
impact  on  the  queue,  and  increasing  the  positive  and  negative  peaks  in  the  autocorrelation  function  by  increasing 
b  results  in  even  worse  performance.  Note  that  the  worst-case  scenario  in  terms  of  possible  starting  times  among 
the  input  sources  is  achieved  when  the  frame  boundaries  of  all  10  sources  arc  synchronized  (c1  «  •  •  •  =  c10). 

The  impact  of  frame-boundary  synchronization  can  be  minimized  by  uniformly  distributing  the  frame  starts 
(boundaries)  at  intervals  of  [M/Nl  as  is  the  case  in  [11],  which  represents  a  best-case  scenario  in  terms  of 
source  starting  times.  While  ensuring  such  a  uniform  distribution  of  frame  starts  in  practice  is  highly  unlikely, 
the  positive  impact  on  the  multiplexer  is  indisputable,  as  is  seen  in  Table  l,  where  loss  probabilities  virtually 
coincide  for  all  values  of  b. 

Recall  that  the  results  presented  here  are  for  one  possible  combination  of  frame  ceil  rates,  and  that  under  a 
quasi-static  approximation,  all  combinations  of  the  individual  frame  cell  rates  would  be  considered.  However,  it 
should  be  noted  that  the  averaging  effect  of  the  quasi-static  approximation  may  mask  the  extreme  queueing 
conditions  that  occur  over  time  horizons  for  which  the  offered  load  is  high.  Hence,  it  may  be  more  reasonable, 
when  applying  the  quasi-static  approximation,  to  consider  only  the  “high-rate"  bins,  which  would  yield  a  more 
detailed  account  of  the  queueing  behavior  during  the  times  in  which  the  QoS  requirements  for  the  VBR  sources 
are  most  likely  to  be  violated.  It  is  possible  that  these  QoS  violations,  which  may  be  sufficiently  rare  when 
considered  over  all  possible  combinations  of  frame  cell  rates,  are  severe  enough  during  the  high-load  periods  to 
disrupt,  or  at  least  impair,  service. 


5.  Conclusions 


In  this  paper,  the  concept  of  source  time-scales  was  applied  to  the  discussion  of  traffic  sources  in  broadband 
packet  networks  such  as  ATM.  A  Generalized  Periodic  Markovian  (GPM)  traffic  model  was  introduced  to 
capture  the  cell  arrival  dynamics  associated  with  these  time-scales,  and  a  finite-capacity  multiplexer  with  an 
input  process  consisting  of  the  superposition  of  N  GPM  sources  was  analyzed  in  discrete  time.  An  efficient 
numerical  technique  was  presented  for  the  solution  of  a  potentially  large  irreducible  Markov  chain  by  solving 
smaller  chains  with  state  space  size  independent  of  the  number  of  states  in  the  input  process. 

As  expected,  numerical  results  suggest  that  the  time-scales  at  which  variations  in  the  cell  arrival  process 
occur,  impact  significantly  on  the  queue  response.  This  fact  seems  to  indicate  that  these  relevant  lime-scales, 
which  change  with  the  link  speed,  should  be  considered  as  important  traffic  descriptors,  and  incorporated  into 
source  models  in  high-speed  environments. 

The  proposed  GPM  traffic  source  model  and  developed  analytical  technique  were  applied  to  the  study  of 
Non-Deterministic  Periodic  (NDP)  traffic  sources  as  well  as  to  VBR  video  sources  which  deliver  cells  at  the 
slice  level,  as  opposed  to  smoothing  and  delivering  them  on  a  frame  by  frame  basis.  Results  from  the  NDP 
source  study  indicate  that,  depending  upon  the  operating  conditions  at  the  finite-capacity  multiplexer,  the  period 
of  the  NDP  input  streams  may  have  either  a  positive  or  negative  impact  on  queueing  performance,  relative  to 
that  of  a  M/D/l/B  queue.  This  implies  that  M/D/l/B  results  should  not  be  used  for  bounding  loss 
probabilities  when  studying  the  queue  response  to  NDP  input  processes.  Furthermore,  the  asymptotic  (large  /V) 
behavior  of  NDP  input  sources  was  shown  to  be  poorly  approximated  by  the  Poisson  process  when  the  potential 
cell  generation  times  occur  periodically  with  period  greater  than  1  (i.e.  m  >  1). 

Results  from  the  VBR  video  application  illustrate  that  the  inter-slice  autocorrelation  function,  which  is 
periodic  in  nature,  has  a  major  impact  on  the  finite-capacity  queue.  The  video  application  was  based  on  the 
assumption,  first  made  in  [9],  that  the  system  can  be  approximately  analyzed  by  independently  studying  the 
queue  response  to  different  frame  cell  rates  and  then  conditioning  over  all  rates.  Numerical  results,  which  were 
derived  for  a  single  frame  cell  rate,  indicate  that  frame-level  pre-buffering  provides  improved  QoS  when 
slice-level  autocorrelations  are  periodic. 
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Appendix  A.  Derivation  of  the  sub-pcriod-Icvcl  autocorrelation  function  R(n) 


This  appendix  is  concerned  with  the  derivation  of  R(n),  defined  as 

Ej  A.- -E2U„-} 


*(«)  = 


E{A2n.}-E2(A„.) 


(A.I) 


The  quantity  E(  A„.  A„.  +  n}  will  be  derived  by  considering  the  cases  n  =  0  and  n  >  0  separately. 
For  n  —  0, 

e{a;.}=  E*2  Pr(A-^k) 

k 

=  E  E  *2  Pr{  K-  = k  I  tv  = A.)  Pr(  tv =  A.)  • 

i  k 

The  second  moment  of  An> .  given  will  be  denoted  by  where 
ti-  E*2  Pr{  A„>  =  k  |  y,.  =  A,}. 


(A.2) 


(A.3) 


This  quantity  can  be  expressed  in  terms  of  the  GPM  parameters  defined  earlier  once  the  cell  arrival  PMF 
/■(•),  1  <  /  <  m,  is  known.  For  instance,  if  /,( • )  is  a  Bernoulli  PMF  with  rate  A,.  then  Pr{A„.  =  A'ly,.  =  A,)  is 
simply  a  binomial  distribution  given  by 


T,-k 


Pr{A„.  =  *|y,.  =  A,}  =  |^‘J(A,)‘(I  “  A,) 
In  this  case,  it  is  easy  to  show  that  is  simply 

fi“7;2AJ  +  r,A,(i  -a,). 

and  (22)  can  be  expressed  as 

E{  A2  }  =  n  [T^A2  +  7,  A,(  1  -  A,)]  77,. 

i 

=  Ti2L^i  +  TtA-T,Z^r 


For  n  >  0, 

E(A,Mj  =  EEMt  Pr {*„•  =  *,.  A.+^k2) 

k,  *, 

=  E  E  E  EM*  Pr{*.-  =  I  Tv”  A,}  Pr{  A„-  +  „  =  ki  I  7V  +  n  =  Ay} 

i  j  *,  k2 

•Pr{TV+„  =  A;l  y,A,}  Pr(TV  =  a,} 

=  P,JEE  A,  A  jp\*}ir,. 

i  i 

From  (A.6)  and  (A.7),  the  quantity  E{AB.A„.  +  n).  for  n>  0,  can  be  expressed  as 
E{  A,A.  +  n)  =  T,2E  ZAPjP^  +  rh  - 

i  j  i 

where  5„  is  the  Kroncckcr  delta. 


(A.4) 

(  A.5) 


(A.6) 


(A.7) 


(A.8) 
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This  expression  can  be  further  simplified  by  noting  that,  due  to  the  periodic  structure  of  the  Markov  chain 
considered  here,  the  stationary  probability  7 r,  is  simply  I /m,  where  rn  represents  the  number  of  stales  in  the 
periodic  Markov  chain.  Also  note  that  the  one-step  transition  probabilities  for  this  source  are  defined  by 


Pi.r 


_  f  I  if  1  <  x  <  m  and  y  -  ( / )  mod  ( m)  +  I , 


0  otherwise. 


(A.9) 


{n)  _  /  1  if  I  <  /  <  m  and  j  =  ( /  +  rt  -  1 )  mod  ( m  +  1 ) , 
\o  otherwise. 


From  these  quantities  it  is  an  easy  computation  to  show  that  the  n-step  transition  probabilities  are  given  by 

(A. 10) 

Eq.  (A.8)  can  now  be  rewritten  as 

t  1  "  \ 

( A.  1 1 ) 


J  2  m  1  m 

E(  An-  An'  +  n)  -  —  53  A*  A‘  +7  A - E 

rtf  ^  rtf  I 


i-  l 


where  j  ~  (/  +  n  —  1)  mod(m)  +  1. 

Finally,  E{A„.}  can  be  derived  as  follows: 

EM„.}  =  E*PrM„.  =  *} 

* 

=  E  E*  Pr(  A.-  =  x- 1  y„.  =  A,}  Pr(  %.  =  A,} 

i  k 

=  ET'.A.tt, 

i 

=  7,A.  (A. 12) 

By  substituting  (A.ll)  and  (A.12)  into  (A. I),  the  final  expression  for  the  autocorrelation  function  R(n)  is 
obtained: 


1  ”  1 

—  E  A,  A  +  — 

m  i- 1  r« 

(  \  m  y 

A - E  A? 

1  m  i- 1  J 

|*„-AJ 

1  "  ,  1 
-  Ea  +- 

m  i- 1  T> 

'  1  ”  J 

A  -  -  E  A 

i-  1  j 

-A2 

*(«)- 


where  again  y  =  (1  +  n  —  1)  mod(m)  +  1. 


(A.13) 


Appendix  B.  Derivation  of  the  quantities  iri  j  and  L 

Let  Xn  denote  the  process  { qn ,  jr„}n>0,  where  n  is  indexed  at  slot  boundaries.  Recall  that  the  process 
xkh>  i’  w^ich  has  been  considered  for  the  analysis  of  the  GPM  sources  at  a  finite-capacity  multiplexer,  is 
indexed  at  sub-period  boundaries;  the  length  of  a  sub-period  is  equal  to  7S  slots.  The  Markov  chain  {qk,  *A}4  >  , 

has  state-space  S  =  {(0,  1) . (£,  A/)}.  Note  that  { qk ,  xk}k> ,  is  embedded  in  the  process  [qn%  jc„}„>0,  anc* 

t  AT( a >  ,  =  (qk%  **}*  >  ,. 

This  appendix  is  concerned  with  the  computation  of  the  stationary  probabilities  for  the  process  X„,  denoted 
by  7 t.j  where 

lim  Pr{<7„  =  i,  Jr„=y}, 

n  -»  x 

as  well  as  the  cell  loss  probability  L. 
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Let  <£  denote  some  event  of  interest  which  can  be  observed  at  slot  boundaries.  Pr{</>)  will  denote  the 
probability  of  this  event.  Let  denote  the  number  of  occurrences  of  <l>  when  the  Markov  chain 

(<7*.  i  is  in  state  ('•  y)eS.  Nj  ji <f> )  will  denote  the  expected  value  of  this  quantity.  If  ■ni  j  denotes  the 
stationary  probability  that  the  Markov  chain  [qk,  ,  is  in  state  (/.  j)  £  5.  then  it  can  be  shown  [26]  that 

<  j 

Computation  of  7 r.  ..  Let  </>  represent  the  event  that  the  system  is  found  in  state  (/.  j)  at  an  arbitrary  time 
instant,  and  let  N.  .{i.  j)  denote  the  number  of  times  the  system  is  in  state  (/.  j)  over  the  current  sub-period, 
given  that  the  sub-period  began  in  state  (i\  /).  Also  denote  by  j)  an  indicator  function  which  takes  the 

value  1  if  the  system  is  in  state  (/,  j)  after  n  time  slots  in  the  sub-period  which  began  in  state  (/',  /);  the 
quantity  equals  0  otherwise.  Clearly,  ;*(/\  j )  can  be  expressed  as 

t; 

AV;(/.  j)  =  E  J)- 

n  -  | 

The  expected  value  of  j)  is  given  by 

R-A>-  J)  =  E  £{/;.,(/.;)} 

n  —  l 

T% 

=  E  Pr{  <?„  =  /.  jr,  =  y  I  q0  =  i'.  x0=j) 

n  —  I 


Z  pYV- 


#1-  l 


Finally,  a  direct  application  of  Eq.  (B.l)  yields  the  result 


*i.j  =  7  E  E  Pj"\ • ' ) ,r.*./  for(  *  •  J) G  s- 

A  n-  1  r- 0 


(B.2) 


Computation  of  L.  In  this  case,  <£  is  the  event  of  a  cell  being  lost  in  an  arbitrary  time  slot.  By  definition,  the 
loss  rate,  or  cell  loss  probability  L ,  is  given  by 


PrU[ 

A  ’ 

where  A  is  the  total  offered  load,  or  the  probability  that  a  cell  arrives  in  an  arbitrary  time  slot.  N({  will  denote 
the  total  number  of  cells  lost  over  the  sub-period  which  began  in  state  (»',  /);  N--  will  denote  its  expected 
value.  Let  t".  denote  the  number  of  cells  lost  over  the  nth  slot  of  this  sub-period  and  let  an  denote  the  number 
of  cell  arrivals  over  the  nth  slot.  It  is  easy  to  sec  that 


k  U  an  =  B  +  \  -  q„_ ,  +  k, 
0  otherwise. 


Clearly, 


N,-f  =  E  A"/- 


n  -  l 
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and 

"tr-  i  E(/"/} 

n  -  1 

T. 

=  E  E*  Pr{a„  =  5+  1  +  *l<70  =  ''-  *o“/} 

n  -  1  t 

r,  a 

=  E  E  E*  Pr{a„  =  B  +  *  -  i  +  k\q0-r.  x0=/}  Pr[qn_l=*i\q0  =  t  .  -r0=/} 

n-  I  i-0  * 

=  E  E  E  - 0//(fl +  >-/  +  *)• 

n-  1  i-O  k-Kt 

where  Kl  is  the  minimum  number  of  losses  which  can  occur  over  the  given  time  slot,  and  K2  is  the  maximum 
number  of  such  losses.  It  is  easy  to  see  that  K ,  =  max(0,  /  -  B  -  1)  and  K2~  N  -  B  -  1  +  /. 

Finally,  applying  Eq.  (B.l)  yields 

^  =  t!tE  E  E  E  E  ¥/“ «)/}( «+>-'  +  *Kv  (B-3) 

r-o/-i  n-1 1- /,  *-*, 
where  /,  =  max(0,  B  +  2  -  N). 
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Abstract 


Sinele  server,  discrete-time  queueing  systems  under  independent  or  Markov  arrival  processes  and  deterministic  service  require¬ 
ments  have  been  studied  widely  in  the  past.  The  fixed  service  time  is  usually  set  equal  to  one  (the  network-time-constant),  defining  the 
time  unit  of  the  discrete-time  system  axis.  Such  queueing  systems  have  been  adopted  for  the  modeling  of  statistical  multiplexers  in 
slotted  communication  networks  supporting  sources  of  (fixed  size)  packetized  information.  In  this  paper,  a  statistical  multiplexer  fed 
by  Markov  packet  sources  is  considered.  Unlike  previous  considerations,  the  time-constant  of  the  sources  is  assumed  to  be  greater 
than  one.  As  a  result,  packets  cannot  be  generated  over  consecutive  discrete-time  instants  whose  distance  is  less  than  the  source  time- 
constant,  although  the  source  may  be  active;  both  the  cases  of  Markov  sources  with  single  and  double  time-constants  are  considered. 
The  resulting  packet  traffic  is  characterized  and  the  formulated  queueing  model  is  studied.  In  addition  to  presenting  an  analysis  of  a 
statistical  multiplexer  fed  by  quite  general  Markov  traffic  streams,  this  study  concludes  that  the  negative  impact  of  source  traffic- 
correlations  on  statistical  multiplexing  can  be  reduced  substantially  when  sources  with  time-constants  greater  than  one  are  con¬ 
sidered.  Even  in  the  absence  of  correlations,  the  guaranteed  idleness  of  the  source  within  a  source-time-constant  s  interval  is  shown 
to  have  a  positive  impact  on  the  multiplexing  process.  Finally,  some  numerical  results  for  the  case  of  multiplexed  packetized  voice 
sources  are  also  presented. 


Keywords:  Statistical  multiplexing:  Time  constraints:  Markov  traffic  streams 


I.  Introduction 

The  probabilistic  nature  of  the  traffic  generated  by 
users  of  a  networking  facility  renders  inefficient  the 
exclusive  allocation  of  the  resources  to  individual  users 
over  a  certain  time  horizon.  Some  type  of  statistical 
multiplexing  of  the  information  generated  by  different 
sources  is  usually  employed  at  the  access  points  of  a  net¬ 
work.  In  this  paper  /V  sources  of  (fixed  size)  packetized 
information  are  multiplexed  before  accessing  a  fixed 
speed,  slotted  transmission  line  (Fig.  1).  The  constant 
packet  transmission  time  (slot)  is  assumed  to  be  equal 
to  one  and  is  called  the  network  time-constant  or  the 
network  (output)  time  slot.  The  slot  boundaries  of 
the  output  transmission  line  define  the  discrete-time 
axis  of  the  system.  Packets  are  temporarily  stored  in  a 
buffer  until  the  output  transmission  line  is  available. 
Packet  arrivals  and  departures  are  assumed  to  occur  at 
the  discrete-time  instants. 


The  above  statistical  multiplexer  has  been  studied  in  - 
the  past  under  correlated  packet  arrival  processes.  The 
most  relevant  work  can  be  found  elsewhere  [  1  —  1 0]  (and 
the  references  cited  there).  In  Heffes  and  Lucantoni  [1], 
the  aggregate  traffic  generated  by  multiple  independent 
input  lines  is  modeled  as  a  2-state  Markov  Modulated 
Poisson  process  using  moment  matching  techniques. 
Then,  the  resulting  statistical  multiplexer  is  studied.  In 
Towsley  [2],  the  author  considers  a  single  input  line 
and  arrivals  which  depend  on  an  underlying  two  state 
Markov  chain.  In  Bruneel  [3],  Viterbi  [4],  and  Reiser  [5], 
N  input  lines  are  assumed  present.  In  Bruneel  [3],  it  is 
assumed  that  the  packet  arrival  process  in  each  of  the 
identical  input  lines  depends  on  an  underlying  two 
state  Markov  chain  (active/inactive).  In  Viterbi  [4]  it  is 
assumed  that  the  per  line  packet  arrival  process  is  a  first 
order  Markov  chain,  and  at  most  one  packet  arrival  is 
possible.  A  closed  form  solution  for  the  mean  packet 
delay  has  been  derived  for  the  latter  case.  In  Reiser  [5), 
a  closed  form  expression  for  the  mean  packet  delay  in 
the  case  of  Bernoulli  per  line  arrivals  can  be  found.  The 
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discrete  time  system  considered  in  Ali  Khan  [6,  7]  corres¬ 
ponds  to  a  statistical  multiplexer  where  packets  arrive 
through  a  single  input  line  and  they  form  a  Markov 
chain.  Finally,  statistical  multiplexers  under  discrete¬ 
time  phase-type  arrival  processes  have  been  considered 
[8-10].  More  references  on  queueing  models  for  statisti¬ 
cal  multiplexers  may  be  found  in  Daigle  [1 1]. 

Extensive  previous  work  on  similar  queueing  systems 
in  a  continuous  time  set  up  can  be  also  found  in  the 
literature.  In  Anick  el  al.  [12],  N  identical  on-off  sources 
with  exponentially  distributed  on  and  off  periods  are 
considered.  The  case  of  A’  identical  input  lines  with  expo¬ 
nentially  distributed  idle  periods  and  arbitrarily  distrib¬ 
uted  active  periods  is  studied  in  Refs.  [13,  14].  A  similar 
model  when  the  input  lines  are  not  identical  has  been 
considered  [15].  Other  continuous  time  models  in  a 
multi-entry  buffer  system  have  been  considered  [16,  17]. 
Finally,  when  the  Markov  processes  describing  the  per 
input  line  packet  arrivals  is  a  reversible  process,  a  decom¬ 
position  method  that  circumvents  the  state  explosion 
problem  has  been  presented  [18]. 

The  correlated  (Markov  dependent)  discrete-time 
packet  arrival  processes  considered  in  the  past  is  typically 
assumed  to  be  generated  by  correlated  sources  whose 
time-constant  is  equal  to  the  network  time-constant; 
the  source  time  constant  is  defined  as  the  minimum 
time  interval  between  consecutive  packet  generation 
instants.  In  this  paper,  the  packet  sources  are  assumed 
to  be  correlated,  as  determined  by  a  first-order  Markov 
model.  Unlike  most  previous  considerations,  the  transi¬ 
tion  time  of  the  Markov  model  (source  time-constant)  is 
greater  than  the  packet  transmission  time  (network  time- 
constant).  The  resulting  traffic  is  more  complex  and 
defines  a  queueing  system  which  is  different  from  those 
considered  in  the  past  under  correlated  traffic.  A  relevant 
queueing  system  has  been  studied  [19,  20]  under  non- 
Markov  correlated  arrivals.  Approximate  and  asympto¬ 
tic  studies  involving  slow  sources  with  Markov  arrival 
processes  have  been  reported  [24,  25]. 

There  arc  many  potential  applications  of  the  queueing 
system  presented  in  this  paper.  It  may  be  adopted  for 
the  modeling  of  a  multiplexer  at  the  access  points  of 
high  speed  networks  receiving  traffic  from  lower  speed 
networks  or  from  sources  with  a  slow  packet  genera¬ 
tion  mechanism  compared  to  the  network  slot.  Due  to 


Fig.  2.  The  first-order  Markov  model  for  the  packet  source. 


limitations  on  the  information  processing  speed,  the 
time-constant  of  most  information  sources  may  be  larger 
than  that  of  a  high  speed  (fiber  optics)  networks. 
Furthermore,  earlier  developed  (low  speed)  networks 
behave  like  slow  information  sources  at  the  access 
point  of  a  high  speed  backbone  network.  Similar  queue¬ 
ing  systems  appear  in  a  computer  processor  or  a  produc¬ 
tion  line  where  jobs  delivered  to  a  server  require  a  pre¬ 
processing  time  greater  than  the  processing  time  at  the 
server.  Some  results  on  the  impact  of  different  network 
transmission  speeds  under  Markov  formulation  and  by 
employing  the  M  G/1  paradigm  [23]  can  be  found  in 
Daigle  [11]. 

In  the  next  section,  the  packet  arrival  processes  gener¬ 
ated  by  some  types  of  slow  sources  arc  described,  and 
a  Markovian  model  is  constructed  for  the  tractable 
description  of  these  processes.  In  Section  3,  a  general 
queueing  model  is  formulated  and  analyzed.  This 
model  is  employed  in  the  study  of  the  multiplexer 
under  the  slow  arrival  processes.  Although  the  analysis 
presented  here  is  different,  the  formulated  queueing 
model  can  be  seen  to  fall  into  the  general  M/G/l  para¬ 
digm  [23].  Numerical  results  are  presented  in  the  last 
section,  together  with  a  discussion  on  the  impact  of 
the  correlation  and  the  source  time-constant  on  the 
queueing  behavior. 


2.  Source  packet  traffic  and  the  formulated  queueing 
systems 

Consider  a  fixed-speed  slotted  network  line  that 
supports  the  transmission  of  packelized  information 
generated  by  A’  sources.  The  network  slot  (or  network 
time-constant)  is  equal  to  the  packet  transmission  time. 
Packet  transmissions  are  synchronized  with  the  network 
slot  boundaries.  The  sequence  of  the  slot  boundaries 
defines  the  discrete-time  axis  of  the  system,  denoted  by 
J;  J  =  {0. 1.2. The  constant  length  packets  generated 
by  the  sources  are  temporarily  stored  in  a  buffer  of  infi¬ 
nite  capacity.  The  formulated  queueing  system  can  be 
modeled  in  terms  of  a  single  server,  First-Come  First- 
Served  (FCFS).  discrete-time  queueing  system  with 
deterministic  and  equal  to  one  slot  service  time,  Fig.  1. 

Let  Sk  denote  the  kih  packet  source;  a  superscript  k 
will  denote  a  quantity  associated  with  the  Alh  packet 
source  for  1  <  k  <  N.  Each  of  the  packet  sources  is 
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assumed  to  be  oil  nor  active  (state  1)  or  inactive  (state  0). 
Transitions  between  the  two  states  occur  according  to  a 
first-order  Markov  model  (Fig.  2).  The  one  step  transi¬ 
tion  time  from  state  0  and  state  l  may  or  may  not  be 
identical,  resulting  in  single-  or  double-time-constant 
sources. 

2.1 .  Single-tinw-constant  sources:  asymmetric  case 

Consider  S  single-time-constant  sources  of  packetized 
information.  As  will  be  shown  below,  the  packet  tralFic 
generated  by  the  Ath  sources  will  be  described  in  terms 
of  the  parameters  {rr*(  l ). ~ A.  r\ c' }  defined  as  follows, 
for  1  <  A  <  .V: 

t ;(/j:  It  denotes  the  steady  state  probability  that  the 
A!h  Markov  chain  is  in  state  /.  /  €  {0.  I }. 

- It  denotes  the  (source)  burstiness  coefficient 
defined  by  />*(!.  1) 0.1):  p'ti.j)  denotes 
the  transition  probability  that  the  Ath  Markov 
chain  moves  from  state  i  to  state  j.  i.j  G  {0.  1}. 

r;:  It  is  dehned  as  the  source  time-constant  and 
it  is  equal  to  the  one-step  transition  time  of  the 
A!h  Markov  chain:  rk  G  Z“,  where  Z"  is  the  set 
of  the  positive  integers.  A  transition  is  assumed 
to  occur  in  the  chain  every  rK  time  units; 
the  same  state  may  be  revisited  at  the  end  of 
the  transition. 

ck:  It  is  a  constant  which  denotes  the  time  instant 
when  the  first  transition  of  the  Ath  Markov 
chain  occurs;  0  <  ck  <  Tk  -  1.  ck  G  Z°.  where 
Z°  is  the  set  of  the  non-negative  integers. 

Note  that  the  steady  state  and  the  transition  prob¬ 
abilities  associated  with  the  Alh  source  can  be  obtained 
from  t*(  1 )  and  7*  by  incorporating  the  Markov  proper¬ 
ties.  The  sequence  of  time  instants  Tk .  TK  C  7.  at  which 
transitions  of  the  Alh  Markov  chain  occur  is  given  by 
the  set 

r‘4Sy:^€Z°) 

The  packet  traffic  {Ak},aj  delivered  by  the  A,h  source  is 
a  discrete-time  process  defined  in  terms  of  the  (under¬ 
lying)  Markov  packet  generating  mechanism.  Packets 
are  assumed  to  be  generated  at  the  end  of  a  one-step 
transition  interval  of  the  Markov  chain  associated  with 
the  source,  as  a  result  of  the  source  activity  during  that 
interval.  That  is,  they  may  be  generated  only  at  some  time 
instant  tK%  tk  G  Tk .  Let  /  be  the  state  of  the  Ath  Markov 
chain  at  ik  -  Ar,  tk  G  Tk%  0  <  A/  <  l.  The  number  of 
packets,  n.  generated  by  the  source  at  tk  is  described  by 
the  probability  mass  ok(i,n),  /G  {0.1},  n  G  Z°.  It  is 
assumed  that  no  packets  are  generated  when  the  source 
is  inactive.  That  is,  o*(0,0)  =  L  1  <  A  <  .V. 

In  view  of  the  above  discussion  {  A k  }}zj  is  completely 
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Fig.  3.  A  realization  of  the  arrival  process  (.if  ({,./  .  n>r  r*  -  Z. 
vl  =  1;  *  denotes  potential  arrival  points. 


determined  in  terms  of  the  parameters 
{-*(n.-j4.r4.f*.o*(I.»)  tor  (2) 

Notice  that  the  sequence  of  time  instants  of  potential 
packet  arrivals  from  the  Ath  source,  Tk ,  is  completely 
determined  from  Eq.  (2).  The  ^ame  ln>lds  true  lor  the 
set  of  time  instants  Lk  at  which  it  is  guaranteed  that  no 
packet  will  be  generated  by  the  Ath  source.  L '  is  given  by 

Lk  =J-Tk  =  Z" 

Note  that  if  j  G  Lk  then  j  4-  nr '  G  Lk  as  well,  for  n  G  Z°. 

The  packet  arrival  process  {.-ly}/r-y  determined  by 
Eq.  (2)  is  different  from  those  associated  with  similar 
queueing  systems  that  have  been  studied  in  the  past. 
When  rk  =  l  (which  implies  that  ck  =  0). 
becomes  the  standard  Markov  Modulated  Generalized 
Bernoulli  (MMGB)  process;  here,  the  term  standard 
implies  that  the  one-step  transition  time  is  equal  to  one 
time  unit.  Of  course,  {/l*}/€/  for  rk  >  l  may  be  seen  as 
a  standard  MMGB  process  pro\ided  that  the  time  unit 
of  the  system  be  equal  to  rk.  When  packet  sources  with 
different  time-constants  are  involved,  the  selection  of  the 
network  time-constant  for  the  construction  of  the  system 
time  axis  J  results  in  convenient  characterization  of  both 
the  arrival  and  the  departure  (from  the  buffer)  processes. 
One  departure  occurs  at  every  time  instant  j  G  7,  pro¬ 
vided  that  the  buffer  is  non-empty.  In  this  paper  it  is 
assumed  that  the  time-constant  of  the  sources  are  integer 
multiples  of  the  network  time-constant.  Thus,  all  arrivals 
will  occur  at  some  lime  instant  in  7.  A  realization  of  an 
arrival  process  { ^ y } >  e  y  f°r  r*  =  -  ancl  ^  =  l  is  shown 
in  Fig.  3. 

A  packet  source  that  generates  an  arrival  process 
described  by  Eq.  (2)  for  some  rk  >  l  is  considered  to 
be  slow  (with  respect  to  the  network  time  constant) 
and  correlated.  Such  packet  sources  may  be  adopted 
for  the  modeling  of  packet  arrival  processes  appearing 
in  communication  networks.  For  instance,  the  packet 
traffic  delivered  by  a  (slow)  transmission  line  to  a  multi¬ 
plexer  feeding  a  network  line  which  is  rk  times  faster,  can 
be  described  in  terms  of  a  process  determined  by  Eq.  (2). 
Another  example  is  the  traffic  delivered  to  a  high  speed 
line  by  a  slow  processor.  In  this  case  rk ,  r;  >  1,  des¬ 
cribes  the  processing  time  required  before  an  output 
(possibly  multi-packet)  is  generated. 


/.  Stavrakakis  ' Computer  Communications  19  1 1996  >  U 12-1123 


1 1 15 


It  is  important  to  note  that  the  packet  arrival  process 
K')./^  has  a  structure  which  affects  significantly  the 
buffer  behavior.  This  process  presents  a  sequence  of 
periodic  time  instants  (contained  in  Lk)  at  which  no 
packet  is  delivered.  The  existence  of  these  periodic 
sequences  has  a  positive  impact  on  the  intensity  of  the 
queueing  problems,  as  it  will  be  illustrated.  Furthermore, 
the  larger  the  value  of  rk ,  the  less  significant  will  be  the 
impact  of  the  correlation  in  the  source  on  the  queueinc 
intensity.  When  rk  >  1,  the  correlation  in  the  arrival 
process,  as  seen  by  the  network  (server),  is  said  to  have 
been  reduced,  compared  to  that  present  in  the  corres¬ 
ponding  source.  A  large  value  of  rk  will  allow  the  net¬ 
work  to  transmit  a  large  number  (up  to  rk)  of  packets 
before  the  next  packet(s)  is  delivered  by  the  correlated 
source.  The  special  case  of  the  arrival  process  {Ak}jeJ 
which  is  based  on  an  uncorrelated  packet  generating 
mechanism  is  also  interesting  and  different  from  an 
independent  and  identically  distributed  packet  arrival 
process.  The  latter  process  may  deliver  packets  over 
consecutive  time  instants  in  J ;  the  former  may  not  if 

v  >  1. 

In  the  rest  of  this  section,  the  cumulative  packet  arrival 
process,  where 


^-el  {-*/}/€  J  —  {(^/*  lj . ^y)}/ e  y  be  an 

(N  A-  1  )-dimensional  discrete-time  process;  /j,  !k  c 
{0, 1},  1  <  k  <  A\  is  a  random  variable  describing  the 
state  of  Sk  at  time  instant  j:{Atj}j€j  is  a  periodic 

Markov  chain  with  state  space  {0. 1, _ M  -  1 }  which 

evolves  as  described  below: 


1  =  (•'(,  ~r  I)  mod  Af  with  probability  1.  (8) 


The  state  of  {A/,}/ey  determines  which  of  the  sources 
undergo  a  state  transition  at  the  current  time.  The 


latter  information  together  with  that  provided  by  the 
process  {(/,'. )}, €  ,  make  {A'}y€,  a  Markov 

process.  Let  f>.  ft  =  {0. 1 } v  x  {0,  ] . Af  -  1 },  denote 

the  state  space  of  {.V,},€  j  and  let  Cl.  Cl  =  2sAf. 
denote  its  cardinality.  The  transition  probabilities  of 
are  given  by 

P{(i'-i: . /■'.«).  (i'.P . /'v. «)} 


-  n 


/>*(>'•/*  i 


n 


l.s‘«  A,,. 


I  j  fi  =  (n  -  1 1  mod  M 

U'J'-... 


(i'.i2 . /"./»). 

/■'.«)  €  n 


(9; 


aj  = 


‘V 


(4) 


is  described  in  terms  of  an  appropriate  MMGB  model. 
Let 


4  =  {Sl  :y€  Tk).  j€J.  (5) 

That  is  A,  is  the  set  of  all  sources  whose  underlying 
Markov  mechanism  undergoes  transition  at  time  instant 
./.  These  sources  potentially  generate  packets  at  that  time 
instant.  From  the  periodicity  of  the  set  Tk  (see  Eq.  (3)) 
turns  out  that  A,  is  also  periodic.  Let 

M  =  iC.l/(r'.T: . ,')  (6) 

where  Af  =  LCAf {•  ■  •  }  denotes  the  least  common  multi¬ 
plier  of  the  argument.  It  is  easily  shown  that 

for // 6  Z°  or  Ay  =  A;niodu  for  jeJ. 

(?) 

For  example,  for  A*  =  2  and  r1  =  2,  r:  =  3,  c]  =  0. 
("  ~  0  turns  out  that  Af  =  6  and  An  =  {S].S:}[ 


where  it  is  defined  that 

IT  p  (/,/)=  1  if  Ain+  Kt  =  {o}. 

k  ; S *  e  mod  u 


Notice  that  only  the  transition  probabilities  associated 
with  the  Markov  chains  of  the  sources  in  Xf 

are  considered,  since  only  these  chains  undergo  a  state 
transition  aty  =  (//  -  1 )  mod  M .  The  first  indicator  func¬ 
tion  imposes  the  condition  that  the  stale  of  the  sources 
not  contained  in  A„_t)nixM/  remain  unchanged.  The 
second  indicator  function  imposes  the  requirement  that 
the  Markov  chain  moves  to  the  next  state  as 

determined  by  Eq.  (8). 

The  cumulative  arrival  process  {A')t  e  j  cart  be  deter¬ 
mined  by  the  packet  generation  probabilities 


o(  /*./"- . n:m)  = 


0  oV:-) 

^  «  •  I  •  mod  W 


(w). 


/  ./■ 


/  €  fi,  m  t  Z 


-  vC 


w  here 


(10) 


- 

/ 

0  oV:-) 

L  A..,  „„„„ 

(w)  S  < 

I  for  m  -  0 

A-fold  convolution  of  o A x ;  - ) , 


I  mod  W  ~  {o} 

^  ^  n  -  I :  mod  \f  =  {o} 

for  all*:j*€  A.n.IlmoiJ„.  aim. 


A,  =  {o}.  A;  =  {s'}.  Ai  =  {5:},  A4  =  {S’}. 
A5  =  {o}. 


Note  that  packets  may  be  generated  only  at  the  transi¬ 
tion  instants  of  the  corresponding  Markov  chain  and  not 


at  every  discrete-time  instant  jj  6  J .  at  which  the  source 
is  active. 

The  original  queueing  system  may  now  be  modeled 
in  terms  of  an  equivalent  system  whose  packet  arrival 
process  is  described  by  a  MMGB  process  {.4;};  a  j\ 
is  described  in  terms  of  the  Markov  chain 
{A  )}j  =  j  and  the  packet  generating  probabilities 
o(.v:/m),  x  6  ft  given  by  Eq.  (10).  This  queueing  system 
is  studied  in  the  next  section. 

2.2.  Single-time-constant  sources:  symmetric  case 


;  . /v  r  /  'vo  .  /  i'J  > 

a  time  instant  in  Tul  in  =  0)  in  which  c:^c  the  tradition 
probability  is  given  by  Eq.  (13).  The  second  indicator 
function  imposes  the  requirement  that  the  state  of 
{if}  remain  unchanged  if  (;  +  I)  £  TlJ  (/j==U).  If 
h  £  0  and  /  =  /,  the  transition  probability  is  equal  to 
one,  provided  that  h  =  (/i  4*  1)  mod  M . 

The  cumulative  packet  arrival  process  {A'f}fcj  can 
be  described  in  terms  of  a  MMGB  process  based  on  the 
underlying  Markov  chain  {A^}y^7  and  the  packet 
generating  probabilities  given  by 

out(i.  «;/w) 


When  the  N  packet  arrival  processes  are  identical 
(and  synchronized)  -  that  is,  when  the  parameters 
in  Eq.  (2)  are  identical  for  all  processes  -  then  the 
( .V  -  1  )-dimensional  Markov  chain  can  be 

replaced  by  the  2-dimensional  Markov  chain 

{•*7 = 

is  a  periodic  Markov  chain  defined  as 
with  parameter  M  =  r;  if  is  the  random 
variable  which  describes  the  number  of  active  sources 

at  time  instant  y,  j  €  J .  Let  QtJ ,  ft'J  =  {0.  I . Y}x 

{0.  I . t  —  1},  denote  the  state  space  of  {X‘J}J€J\ 

its  cardinality  f VJ  is  equal  to  (<V  -f  l)r.  The  sets 
Tk(Lk)  defined  by  Eq.  (1)  (Eq.  (3))  are  identical  for  all 
A\  l  <  k  <  iV,  and  they  are  denoted  by  Tld(L:J).  The  set 
of  active  sources,  is  given  by 


A 


(ID 


f  {o}  tfy  €  l 

“A  {s\s2,...,ss}  if y  e  T. 

For  simplicity  it  may  be  assumed  that  ck  =  0  for  all  k . 
1  <  k  <  iV.  In  this  case  Eq.  (II)  becomes 

{o}  if  ymodr  =  0 

(5\5:,  ...,5s}  otherwise. 

The  transition  probabilities  of  [Xf}j  $  j  are  given  by 
=[<?(/. /)  t  (*=<)}  +  = 


x  1  {  n  =  (n  +•  1)  mod  r} »  (1^) 

where  the  transition  probabilities  <7(1.1).  0  <  /,  i  <  <Vf 
of  the  Markov  chain  {lf}k  <=  ru  ~  defined  on  the  transi¬ 
tion  time  instants  in  Tid  -  with  unit-step  transition  time, 
r,  are  given  by 


<7(L0 


k  = 


min  { 1.  i } 

E 


max{0,i  +  i-.V} 


l)p(0,0)v""(,;_*).  (13) 

The  last  indicator  function  in  Eq.  (12)  imposes  the  con¬ 
dition  that  the  Markov  chain  {Mj}j  <=  j  moves  to  the  next 
state  as  determined  by  Eq.  (8).  The  first  indicator  func¬ 
tion  imposes  the  requirement  that  a  transition  occurs  at 


0  for  ni  £  0 

if  (n 

1  for  m  =  0 

if  (n 

[3Io*,,{l  :•)](/«) 

if  n  - 

+  1 )  mod  M  0  or  /  =  0 
-r  l )  mod  M  £  0  or  i  =  0 
h  1=0, 

(14) 


where  0,  denotes  an  /-fold  convolution  and  oiJ(I:-) 
denotes  the  common  packet  generating  probability  asso¬ 
ciated  with  a  single  source,  as  employed  in  Eq.  (2). 

Although  the  cumulative  packet  arrival  process  in 
the  case  of  identical  packet  sources  may  be  described  in 
terms  of  { X} },  €  j  (and  the  corresponding  probabilities 
be  given  by  Eq.  (10)),  the  formulation  of  {A 'f}jej 
results  in  a  state  space  Q,J  whose  cardinality  fi,J 
increases  linearly  with  respect  to  N;  on  the  other  hand, 
ft  increases  exponentially  with  respect  to  N.  As  it  will  be 
seen  in  the  next  section,  the  cardinality  at  the  underlying 
Markov  chain  affects  significantly  the  complexity  of 
the  solution. 


2.3.  Double-time-constant  sources 

In  the  packet  arrival  processes  considered  so  far,  it 
has  been  assumed  that  the  underlying  Markov  mechan¬ 
ism  of  the  source  makes  a  transition  every  rk  time  units. 
As  a  result,  the  set  Tk  determines  the  sequence  of  time 
instants  in  J  at  which  activation  of  the  source  may  occur. 
In  some  applications  it  is  reasonable  to  assume  that  it 
takes  rk  time  units  for  the  source  to  generate  (prepare)  a 
packet,  when  active.  On  the  other  hand,  the  re-activation 
of  the  source  may  occur  at  any  time  instant  and  not 
necessarily  at  some  integer  multiple  of  rk .  A  correlated 
source  with  this  behavior  may  be  described  in  terms  of 
the  Markov  model  shown  in  Fig.  2,  with  two  different 
one-step  transition  times  (double-time-constant  source). 
A  transition  from  state  1  (active)  occurs  every  r*  time 
units  while  a  transition  from  state  0  (inactive)  occurs 
every  one  time  unit.  The  packet  arrival  process  generated 
by  such  a  source  is  described  by  the  parameters  in  Eq.  (2) 
with  the  understanding  that  transition  from  a  state  0 
may  occur  in  the  next  slot.  Under  this  modeling,  the 
re-activation  of  the  source  may  occur  at  any  time  instant 
in  J  and  not  necessarily  at  the  time  instants  contained 
in  Tk.  The  length  of  an  active  period  is  geometrically 
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transition  probabilities 

r'W'j2 . 


p‘(1.0) 

Fig.  4.  The  Markov  model  for  {)'*}(  , «  j\- 

distributed  with  mean  l/p*(l,0)  in  time  units  of  length 
Tk\  the  length  of  an  inactive  period  is  geometrically 
distributed  with  mean  l///(0. 1)  in  system  time  units. 

The  double-time-constant  Markov  source  introduced 
above  may  be  alternatively  described  in  terms  of  the 
Markov  chain  shown  in  Fig.  4.  Let  ft*, 

il\  =  {0,1 . r*}  denote  its  state  space  whose  car¬ 

dinality  Clk  is  equal  to  t*  +  1.  Let  pk{ij).  i.j  €  fi*. 
denote  the  transition  probabilities  of  {}k}t£j\  lhe 
non-zero  such  probabilities  are  given  by 

pk{i-j)  —  ••  j  —  ‘  +  !•  1  -  '  <  T ^ ’ 

/»f(r4.l)  =  1  -pk(\.0).  (15) 

pk(rk.  0)  =/»*(!.  0).  pky(0,])=pk(0,l). 

pk(0.0)=\-pk{0J).  (15) 

Note  that  state  0  in  Fig.  4  corresponds  to  the  inactive 
state  0  (Fig.  2).  A  visit  to  state  r*  determines  a  time 
instant  at  which  the  active  source  will  undergo  a  state 
transition.  The  source  moves  from  state  r  to  state  0 
(becomes  inactive)  in  the  next  time  instant  with  prob¬ 
ability  />;Vr\ Oi  =  Pk{  1,0);  the  source  moves  from  state 
0  to  state  1  with  probability  /?*.(0. 1 )  =  />*(0. 1 ).  The 
source  remains  in  the  active  state  for  rk  time  units. 
Upon  visit  to  state  rk  packets  are  generated  according 
to  the  probabilities 

of(rA:mj  =  o*(/:w),  m  €  Z°.  (lb) 

No  packets  are  generated  from  the  other  states.  That  is. 

d}(/:0)  =  1.  0  <  /<  r\  (16) 

In  view  of  the  above  discussion,  the  packet  arrival 
process  {Bk}jeJ.  generated  by  the  A*th  double-time- 
constant  source,  1  <  k  <  A\  can  be  described  as  a 
MMGB  process  based  on  the  underlying  Markov  chain 
{Yj})€j  and  the  packet  generating  probabilities  given 
by  Eq.  (16). 

The  cumulative  packet  arrival  process  {Arj)j£js 
generated  by  the  arrival  processes  {!?*},  €  j,  \  <  k  <  N* 
can  be  described  as  a  MMGB  process  based  on  the 
A'-dimensional  Markov  chain  {X]  }y  €  j  =  {( Y} ,  -  *  • 
Y?)}j  €  j  with  state  space  ftf  =  x  ft;.  x  ft*. 


k  =  I 

and  packet  generating  probabilities 
o'(y\y\...ys:k)  =  \$ek(yk;-m,  k  e  Z°, 

k 

- — vA )  €  ft,.  (18) 

In  the  next  section  the  analysis  of  the  queueing  model 
formulated  above  is  presented. 

3.  The  queueing  system  under  MMGB  arrivals 
3.1.  The  general  case 

Consider  the  FCFS.  single  server,  discrete-time  queue¬ 
ing  systems  with  cumulative  packet  arrivals  described 
by  a  MMGB  process  {.4,}w;  J  denotes  the  system 
axis,  as  defined  in  the  previous  section.  Let 
denote  the  underlying  Markov  process  associated  with 
{Aj}J(J;  let  ft  denote  its  state  space  with  cardinality  ft; 
let  tt(a*)  and  p{x.y)  denote  its  steady  state  transition 
probabilities,  a*,  v e  ft.  Let  o(.v;  k)  denote  the  probabilistic 
mapping  which  determines  the  probability  that  k  packets 
arrive  when  {A';};,/  is  in  state  x,xe  ft,  k  e  Z°  and  k  <  Ry 
0  <  R  <  oc;  let  p(x)  and  cr(.v)  denote  the  first  and  the 
second  moments,  respectively,  of  the  probability  mass 
function  <t>[x:  •). 

Let  pm(j:y)  denote  the  joint  probability  that  there  are 
j  packets  in  the  (queueing)  system  at  the  mlh  time  instant 
(packet  arrivals  at  this  time  instant  arc  included)  and 
the  state  of  {A',}y,y  is  y;  the  packet  arrivals  generated 
due  to  the  visit  to  state  y  have  not  been  accounted  for. 
The  evolution  of  the  buffer  occupancy  process  can  be 
described  by  an  (ft  -f  1  )-dimensional  Markov  chain 

with  state  space  ft  x  {0.  1,2 _ }  and  state  probabilities 

described  by  the  equations: 

R 

pn,(j:y)  =  Y.Y.  1  -  A-:.v)/»(.v,  r)0(-v:  k). 

x*Uk  =  0 

j  >  R  ~  \  (19a) 

pmW- }')  =  52p(-v..v)52 [pm~'(k\.x)6(xj  +  1  -  k) 

jt<f!  *=i 

+  pm~  1  (0;  .v)o(.t;  j)].  0  <j<R.  (19b) 

Under  the  stability  condition  for  the  queueing  system, 
given  by 

A  =  y^/i(.v)~(.r)  <  1  (20) 

A  f  !  t 

the  (ft  +  1  {-dimensional  Markov  chain  described  in 


I  1  <  ' 


I  .V\, 


. .  *'  .r  (  C  on. 


/V  l*'  *f'  /  ’  /J  / 


(1) 

(2) 

(3) 

is! 


Eq.  (19)  is  ergodic.  By  considering  the  limiting  prob¬ 
abilities  as  m  —  oc  and  their  generating  function,  the 
following  equations  are  derived: 

/»(--:  r)  =  +  (-*  -  l)/»(0:.v)] 

*  =o.t<n 

x  p(\\y)o(x:k)  (21) 

p(0:  x)  denotes  the  boundary  probability  that  the  system 
is  empty  and  the  state  of  the  packet  arrival  process  is  .v; 
its  computation  is  discussed  in  Appendix  A.  P(z:y) 
denotes  the  probability  generating  function  of  p{j:y), 
0<j<  oc.  Note  that 

P(r)  =  £>(r;.v)  (22) 

I  <11 

where  P{z)  denotes  the  probability  generating  function 
of  the  buffer  occupancy  process.  Since  the  k[h  derivative 
of  P{z).  evaluated  at  r  =  1.  is  equal  to  the  klh  factorial 
moment  of  the  number  of  packets  in  the  system  [21], 
it  turns  out  that  the  A:th  moment  can  be  evaluated  by 
differentiating  Eq.  (21)  k  times,  setting  r  =  1  and  solving 
the  resulting  system  of  equations.  These  equations  are 
not  linearly  independent.  An  additional,  linearly  inde¬ 
pendent  equation  is  derived  by  following  a  procedure 
similar  to  that  shown  in  Appendix  B  (tor  the  first 
moment)  and  invoking  L'  Hospital's  rule.  By  differen¬ 
tiating  Eq.  (21)  and  setting  r  =  l  the  following  system 
of  H  linear  equations  is  obtained  with  respect  to  P'( l;.r), 
acH. 

P’(Uy)  =  £>(*.  »•)[./>'(  l;.v)  +  (/<(v)  -  1  )»(.*) 

<<t! 


+  p(  0;-v)],  vefi 


The  linearly  independent  equation  (Appendix  A)  is 
given  by 

Y  [2(/i(.v)  -  l)P'(l:.t)  +  2(p{x)  -  \)p(0:x) 

+  (2  +  a{x)  -  3ji(.r))ir(.t)]  =  0  (23b) 

The  mean  buffer  occupancy,  Q j  =  P'{  1),  and  the  mean 
packet  delay,  D  (by  using  Little's  theorem),  are  then 
obtained  from 

G, ,  =  IP].  Z>  =  |  (24) 

where  I  is  the  fi-dimensional  identity  row  vector.  P]  is 
the  transpose  of  the  row  vector  of  />,(l:.v).  .veft.  and  A 
is  the  mean  packet  arrival  rate  given  by  Eq.  (20). 

If  is  an  independent  and  identically  distributed 

process,  it  may  be  considered  to  be  a  MMGB  process 
based  on  an  underlying  Markov  chain  with  a  single  state, 
x0.  In  this  case  Q  =  {.v0},  rr(.v0)  =  1,  p(x0,x 0)  =  1. 
/i(.r0)  =  A  and  (7(.v0)  =  <r.  From  Eqs.  (23).  (24)  it  is  easily 
obtained  that 


Qf 


=  A  + 


a  -  A 

2[Too 


and 


DmJ 


=  1  + 


<t/A  -  1 
2(1  -  A)* 


(25) 


Eqs.  (23)  and  (24)  provide  for  the  mean  packet  delay 
induced  by  the  queueing  system  under  all  source  models 
considered  in  Section  2.  The  dimensionality  of  this 
system  of  equations  is  equal  to  the  cardinality  of  the 
underlying  Markov  chain  Under  single-time- 

constant  sources  and  asymmetry  (Section  2.1).  the  under¬ 
lying  Markov  chain  has  a  state  space  of  car¬ 

dinality  2VA/;  its  transition  probabilities  are  given  by 
Eq.  (9)  and  the  associated  packet  generating  probabil¬ 
ities  by  Eq.  (10).  Under  single-time-constant  sources 


(23a) 
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and  symmetry  (Section  2.2),  the  underlying  Markov 
chain  has  a  state  space  of  cardinality 

(A  -p  I  )r:  its  transition  probabilities  are  given  by 
Eqs.  (12).  (13)  and  the  associated  packet  generating 
probabilities  by  Eq.  (14).  Notice  that  the  number  of 
equations  which  need  to  be  solved  increases  exponen¬ 
tially  for  the  asymmetric  case’ and  linearly  for  the  sym¬ 
metric  case,  with  respect  to  the  number  of  sources  Ar. 
Under  double-time-constant  sources  (Section  2.3).  the 
underlying  Markov  chain  { X '}  has  a  state  space  of 
cardinality  ni  =  i (T*  +  transition  probabilities 

are  given  by  Eq.  (17)  and  the  associated  packet  generat¬ 
ing  probabilities  by  Eq.  (18).  When  the  double-time- 
constant  sources  are  identical  with  time-constant  r. 
the  number  of  linear  equations  which  need  to  be  solved 
can  be  significantly  smaller  than  =  (r+  l)v.  In 
Appendix  C  it  is  shown  that  the  number  of  equations 
in  this  case  is  given  by 

r  _  (A*  —  t ) ! 

~  yTTi  •  (26) 

The  reduced  set  of  linear  equations  and  a  discussion  on 
the  numerical  complexity  may  be  found  in  Appendix  C. 


Curve  (2)  presents  the  induced  packet  delay  when  ck , 
1  <  k  <  A\  are  not  identical  for  all  sources  but  they  are 
spread  nearly  uniformly  over  r.  When  r=  1,  no  such 
spreading  is  possible  and  ck  =  0,  1  <  k  <  A'.  In  this 
case  curves  (1)  and  (2)  coincide.  Notice  that  the  larger 
the  value  of  r.  the  larger  the  smoothing  of  the  arrival 
process  that  can  be  accomplished  through  the  spread¬ 
ing  of  ck .  The  intensity  of  the  queueing  problems  is 
decreased  as  r  increases,  as  a  result  of  the  distribution 
of  the  potential  packet  arrival  instants  associated  with 
the  various  sources.  The  cumulative  arrival  process 
IS  determined  as  described  in  Section  2.1.  In 
this  case.  2 ~Tr  linear  equations  arc  contained  in  Eq.  (23). 

Curve  (3)  presents  the  mean  packet  delay  results 
when  the  packets  are  delivered  by  each  of  the  A'  =  2 r 
sources  according  to  a  Bernoulli  process  with  rate 
~  (1  )/t  packets  per  time  unit.  The  delay  results,  in  this 
case,  are  obtained  from  Eq.  (25).  The  resulting  cumu¬ 
lative  packet  arrival  process  is  a  binomial  with  param¬ 
eters  2rand  zk(\)/r.  The  second  moment  of  this  process 
is  given  by 


o  —  A - 

v  '*(!)" 

T 

r 

T 

4.  Results  and  conclusions 

In  this  section,  numerical  results  are  presented  for 
the  mean  packet  delay  induced  by  a  statistical  multi¬ 
plexer  under  the  packet  arrival  processes  described  in 
Section  2. 

In  Fig.  >  some  numerical  results  are  presented,  as  a 
function  of  the  source  time-constant  r,  under  various 
packet  traffic  models.  The  packet  sources  are  assumed 
to  be  identical  with  parameters 

{-‘in  =  0.35.  *  *  =  0.93.  T.  ck  =  0.  o‘(l.  1)  =  1}. 

I  <  k  <  A. 

where  the  number  of  sources.  A',  is  set  to  be  equal  to  2 r. 
The  resulting  cumulative  packet  traffic  rate  is  equal  to  A’ 

( I ) / t  =  0.70  packets  lime  units.  Notice  these  packet 
sources  have  the  characteristics  of  packetizcd  voice 
sources  delivering  one  packet  every  r  time  units,  when 
active. 

Curve  ( 1 )  in  Fig.  5  presents  the  induced  packet  delay. 
Notice  that  for  the  same  total  traffic  rate,  the  induced 
packet  delav  decreases  as  the  lime-constant  increases. 
This  decrement  in  the  delay  is  attributed  to  the  reduction 
of  the  source  correlation  (-/  =  0.93)  as  seen  by  the  server 
(as  r  increases),  as  well  as  to  the  increased  independence 
in  the  cumulative  traffic  due  to  the  increased  number  of 
(mutually  independent)  contributing  sources  (A'  =  2 r). 
The  cumulative  arrival  process  { A f}i  f  j  is  determined 
as  described  in  Section  2.2.  In  this  case,  (2 r  +  1  )r  linear 
equations  are  contained  in  Eq.  (23). 


2-*(I) 


1  - 


+  [2-tO)]2. 


Notice  that  a  increases  as  r  increases,  and  thus  the  mean 
packet  delay  also  increases,  according  to  Eq.  (25).  This 
trend  is  clearly  observed  in  curve  (3)  of  Fig.  5.  Notice 
that  as  r  increases  the  difference  between  curves  (1)  (or 

(2) )  and  curve  (3)  decreases,  as  expected. 

Curve  (4)  presents  the  delay  results  under  the  traffic 
modeling  considered  for  curve  (1)  but  assuming  that 
7  =  0.  1  <  k  <  X.  That  is.  the  sources  are  assumed  to 
be  uncorrelated.  Notice  that  the  resulting  cumulative 
packet  arrival  process  is  different  from  the  binomial 
process  which  was  considered  for  the  derivation  of 
curve  (3).  For  =  0.  1  <  k  <  A’,  and  r>  1,  the  source 
can  generate  packets  only  at  time  instants  in  Tk  (sepa¬ 
rated  by  r  time  units)  and  not  at  any  time  instant  in  7, 
which  is  the  case  under  the  underlying  Bernoulli  model 
assumed  for  curve  (3).  Notice  that  the  delay  results  have 
smaller  values  in  this  case  compared  to  those  under  curve 

(3) ,  possibly  due  to  the  positive  cfTcct  of  the  constraint 

that  packets  cannot  be  generated  at  time  instants 
separated  by  less  than  r  lime  units  under  the  model  for 
curve  (4).  The  cumulative  arrival  process  {A^^j  is 
determined  as  described  in  Section  2.2.  In  this  case,  r 
linear  equations  are  contained  in  Eq.  (23),  since  the 
underlying  Markov  chain  becomes  one  dimen¬ 

sional  and  it  is  described  by 

Finally,  curve  (5)  presents  the  delay  results  under  the 
model  considered  for  curve  (4),  but  under  the  assump¬ 
tion  that  the  values  of  ck ,  1  <  k  <  Afi  are  ncar-uniformly 
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Fig.  6.  (a)  Mean  delay  results  for  the  ,V  =  3  packet  sources  under  various  traffic  models;  0  <  A  <  0.9;  (b)  mean  delay  results  for  the  .V  =  3  packet 
sources  under  various  traffic  models;  0.9  <  A  <  0.97. 

spread  over  r.  As  it  was  explained  before,  this  spreading 
of  the  potential  packet  arrival  instants  has  a  positive 
impact  on  the  queueing  intensity;  this  effect  is  clearly 
observed  by  comparing  curves  (4)  and  (5)  of  Fig.  5. 

The  cumulative  arrival  process  {/<,};  €y  is  determined 
as  described  in  Section  2.2.  In  this  case,  M  = 

LCM{r,r _ ,t}=t  linear  equations  are  contained 

in  Eq.  (23),  since  the  underlying  Markov  chain 
{Xj }j<zj  becomes  one  dimensional  and  it  is  described 
by  {A?,}>ey. 

Notice  that  when  r  =  1  the  arrival  processes  may 
change  state  at  any  slot.  In  this  case,  the  results 
may  also  be  obtained  from  the  closed  form  expression 


given  by  [4]: 


Fig.  6  presents  the  delay  results  obtained  by  multiplex¬ 
ing  N  =  3  packet  sources  with  different  time-constants 
given  by  r 1  =  2,  r2  =  4  and  r3  =  4.  The  rest  of  the  para¬ 
meters  are  assumed  to  be  identical.  The  source  of  bursti- 
ness  coefficient  is  equal  to  7 td  =  0.5  and  the  packet 
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generating  probabilities  are  given  by  1)  =  0.3, 

4>id{\.2)  =  0.4  and  1 . 3)  =  0.3.  The  probability  that 
the  packet  source  is  in  the  active  state,  ^(l),  is  selected 
so  that  the  resulting  cumulative  packet  traffic  rate  is 
equal  to  some  value  from  the  horizontal  axis  of  Fig.  6. 
Curves  (1).  (2),  (3),  (4)  and  (5)  show  the  mean  packet 
delay  under  the  traffic  modeling  considered  for  the  deri¬ 
vation  of  the  corresponding  curves  of  Fig.  5.  Curve  (6)  is 
derived  under  the  assumption  that  each  of  the  sources 
may  be  re-activated  at  any  time  instant  and  not  at  integer 
multiples  of  rk ,  1  <  k  <  3.  Thus,  the  double-time- 
constant  source  model,  discussed  in  Section  2.3,  has 
been  assumed.  n*  =  i(r*  +  1)  linear  equations  are  con¬ 
tained  in  Eq.  (23),  in  this  case.  By  exploiting  the  partial 
symmetry  ( S 2  =  Sy)  and  properly  formulating  the  equa¬ 
tions  in  Eq.  (23)  (see  Appendix  C)  a  reduced  number  of 
linear  equations  will  need  to  be  solved.  Curve  (7)  presents 
results  under  the  source  modeling  considered  for  curve 
(6)  under  the  assumption  that  *)k  =  0,  1  <  k  <  3.  Notice 
that  slates  0  and  1  may  be  merged  in  this  case  (Fig.  4), 
resulting  in  a  reduced  number  of  linear  equa¬ 

tions  contained  in  Eq.  (23).  Further  reduction  may  be 
achieved  by  exploiting  the  partial  symmetry  in  the  arrival 
process,  as  explained  before. 

Notice  that  the  coincidence  of  curves  (1)  and  (2)  and 
curves  (4)  and  (5)  is  due  to  the  specific  parameters  of  the 
selected  example.  Finally,  notice  that  curve  (6)  is  always 
above  curves  (1)  and  (2)  derived  for  the  same  values  of 
1  <  k  <  3.  Similarly,  curve  (7)  is  always  above  curves 
(4)  and  (5)  derived  for  *)k  =  0,  1  <  k  <  3.  This  behavior 
may  be  explained  in  view  of  the  fact  that  the  minimum 
separation  between  two  consecutive  active  periods  in  the 
traffic  for  the  curves  (1),  (2),  (4)  and  (5)  is  equal  to  r\ 
1  <  k  <  3,  while  that  under  the  traffic  models  for  the 
curves  (6)  and  (7)  is  equal  to  one.  The  latter  is  expected 
to  have  negative  effect  on  the  intensity  of  the  resulting 
queueing  problems.  Finally,  the  increased  delay  results 
presented  under  curve  (3)  may  be  explained  in  view  of  the 
fact  that  there  is  no  minimum  separation  between  con¬ 
secutive  packet  arrivals,  in  any  case,  for  the  traffic  model 
for  curve  (3). 
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Appendix  A 

The  boundary  probabilities  p( 0;.v)  are  obtained  as 
follows.  When  there  is  at  most  one  stage  ,v0  in  Q  such 


that  d>(.v0: 0)  >  0  (that  is,  ,v0  is  the  only  state  which  may 
not  deliver  a  packet  to  the  system),  then  it  is  easily  estab¬ 
lished  that 

P(0;.y)  =  q$p{xQ.x),  (A1) 

where  q0  =  1  -  A  is  the  probability  that  the  system  is 
empty.  Notice  that  x0  is  the  only  state  at  which  {A )}j<j 
could  have  been  in  the  previous  slot,  so  that  the  system 
is  currently  empty.  When  the  previous  condition  is  not 
satisfied,  the  boundary  probabilities  may  be  obtained 
through  the  following  procedure  which  may  be  found 
in  Ref.  [10]: 

p( 0;.v)  =  p(0:/)  =  f0n»(i)  for  .Ye f)  or  /c{  1,2 . fi}, 

(A. 2) 

where  /,  1  <  /  <  fi,  denotes  an  ordering  of  the  states  in 
M0*  1  <  /  <  f),  are  computed  from 

nK#=  i,  nK[/’j  =  fiH  (a. 3)  . 

where  fi,  =  [ttm(1  ),..., 7r^ (Q)]  and  the  matrix  [/*„]  is 
computed  from 

i/>.]  =  E(C, )[/>)[/>.]*.  (A.4) 

*  =  o 

where  [P]  is  the  transition  probability  matrix  of  the 
Markov  chain  {Xj}jtj  and  [G*]  is  an  Cl  x  Cl  diagonal 

matrix,  [Gk]  =  diag[<p(l;A'), tf>(2; k) . <^(fi;A)],  0  <  k 

<  R:  R  is  the  maximum  number  of  packet  arrivals  over 
a  slot;  {<J(/: *)}"=i  =  {<?(*:*)}„!)•  0  <  k  <  R. 

Appendix  B 

The  linearly  independent  equation  shown  at  Eq.  (23b) 
is  derived  by  differentiating  Eq.  (21);  adding  up  the 
resulting  equations  and  using  the  derivative  of  Eq.  (22): 

*  =  O.tffi  L 

x  [P{::.\)  +  (r  -  l)/»(0,.v)]  +  :k~' 
x  (/>'(;;  .v)+/j(0;.v)]] 

By  adding 

ZP'(z)  ]T  P'(:\ x)<t>(x;k)  =  0 

k  =  0  x  til 

to  the  right-hand  side  of  the  previous  equation,  the 
following  expression  is  obtained 

?'(:)  =  fzz  (B.  1 ) 


I 


/  V. 


I'* 


m:  n:.i 


where 


■<(--)  =  Z  53  0(.v:  *)[(*- 


A---S 0  v*ii 


+f-  -  1  MO:  .V)]  +  =*  - 1  [/>'(--:  a)  +  p(0:  a)] 
a-)1. 


Since  /*'(  l)  is  equal  to  the  mean  bufTer  occupancy,  which 
is  finite  if  Eq.  (20)  holds,  and  since  1  -  r  =  0  for  r  =  I. 
Pr(  1  j  is  computed  from  Eq.  (B.l)  by  using  L*  Hospital's 
rule.  Thus, 


P'[ 


i 

i 

-i| 

t:  -  I 


dA(z)/d: 
d(\-z)!J:: a| 


-d Aii)  c/j 


i 


i  B2) 


From  Eq.  (B.2)  and  the  derivative  of  Eq.  (22)  evaluated 
at  r  =  I,  Eq.  (23b)  is  easily  derived. 


Appendix  C 

Let  r  -  1  be  the  cardinality  of  the  state  space  of  the 
(assumed  identical)  underlying  Markov  chains  {Ykf}ftj 

(see  Section  2.3).  Let.v  =  (.V|..y: . ( j*  denote 

a  state  of  the  underlying  N  dimensional  Markov  chain 
{ .Vy  \  f  *  j '  Since  the  packet  arrival  processes  {Bk}jtJ  are 
identical,  the  resulting  cumulative  arrival  process 
{A]}jtJ  will  assume  (probabilistically)  the  same  values 
under  states  .v  €  which  correspond  to  the  same 
number  of  sources  in  each  state;  for  instance  states 
('W: . *'r-i)  and  (/:,/, . Ir  + ,). 

If  f-(.v)  =  (c,(.v),r:(.r)t...iv*i(.v))  is  a  (r+1)- 
dimensional  vector  with  (.v),  /  =  1,2 _ r  +  1,  denot¬ 

ing  the  number  of  input  Markov  chains  at  state  /, 
then  each  such  vector  v(x)  with  the  constraint 
YlJ=\vi(x)  =  represents  a  class  of  equivalent  states 
v.  The  number  of  equivalent  states  x  in  a  class  t?(.v)  is 
given  by  the  multinomial  coefficient  [22] 

iV 

iV! 

^f1(.v)!uI(.x)!...tv+,(.v)!’ 

where  vt  (.v )  is  the  number  of  input  processes  in  state  /, 
as  determined  by  .r,  /  =  1,2 . r+1.  This  is  basi¬ 

cally  the  number  of  ways  in  which  a  population  of  Af 
elements  can  be  divided  into  r+  1  ordered  parts  of 
which  the  first  contains  U|(.v)  elements,  the  second  con¬ 
tains  t’:(.v)  elements  and  so  forth.  Let  F  be  the  set  of 
representative  states  x  of  the  symmetric  system  (i.e.  no 
two  states  xeF  belong  to  the  same  class  of  equivalent 
states).  Let  i’(.y0)  be  the  class  of  the  equivalent  to  .v0 
states.  For  each  .v0,  y0 1 F,  Eqs.  (23a)  and  (23b)  can  be 
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written  as  follows: 

P'(i.h)  =  Zj 

,  Z  rt*'.ro)| 

/’'(I;- v) 

t0  f  F  \ 

[  v*  el  To!  J 

1 

+  Z  o)  [(/»(•'■)  -  n-(A-)  +P{ 0:.v)]. 
x<  ir 


jo  £  F  (C. 2a) 

^c(.v0)2(/t(.v0)-l  )/>'(!:. v0) 

To*  F 

+  j:{2(n{X)-l)p[0:X) 

T<  iV 

+  [2  +  a-3M.v)k(.t)}  =0.  (C.2b) 

# 

where  c(.v0)  is  given  by  (C.l).  The  mean  buffer  occu¬ 
pancy  can  be  obtained  from 

Q,  =  P’(\)=  Y,  P'(\-..x0)c(.x0).  (C.3) 

x0*  F 

The  induced  average  packet  delay  is  then  obtained  from 
Eq.  (24).  The  reduced  number  of  equations  in  Eq.  (C.2) 
are  then  easily  seen  to  be  given  by  Eq.  (26),  which 
expresses  the  number  of  ways  of  partitioning  /V  things 
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into  r  +  1  groups  [22].  Notice  that  significant  reduction 
of  the  number  of  required  equations  may  be  achieved 
under  symmetry.  Comparison  of  ASym  (see  Eq.  (26)) 
and  A  jsym  =  (r+  1)A  for  some  pairs  (:V.r)  which  result 
in  less  than  1000  linear  equations  in  Eq.  (C.2),  are  shown 
in  Table  1. 
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Abstract 

Networks  are  expected  to  support  applications  with  diverse  traffic  characteristics  and  Quality  of  Service  (QoS) 
requirements,  such  as  voice,  data,  video  and  multi-media.  In  this  environment  it  is  important  to  determine  whether  a  given 
QoS  vector  is  achievable  (for  call  admission  control),  and  if  it  is,  design  efficient  resource  allocation  policies  that  deliver 

it.  . 

In  this  paper  a  system  of  T  resources  (wireless  channels)  shared  by  a  number  of  un-buffered  applications  is  considered. 
Packets  which  are  not  allocated  a  channel  within  the  current  assignment  cycle  are  dropped.  The  QoS  vector  is  equivalently 
described  in  terms  of  (diverse)  packet  dropping  probabilities  ( p, ),  or  (diverse)  packet  dropping  rates  (dt).  The  region  of 
achievable  QoS  vectors  is  precisely  determined  and  easy  to  implement  scheduling  policies  that  deliver  the  achievable  QoS 
vectors  are  derived.  Some  applications  to  call  admission  control  and  resource  management  are  presented  along  with  some 
numerical  examples. 

Keywords:  Integrated  services;  Wireless  networks;  Achievable  QoS;  Scheduling 


1.  Introduction 

The  problem  of  allocating  shared  resources  is  an  interesting  and  continuously  revisited  one.  The 
primary  resources  to  be  shared  in  a  networking  environment  are  memory  and  transmission  bandwidth. 
As  technology  advances  and  resources  increase  their  capabilities,  existing  services  are  enhanced  and 
new  services  are  provided,  driving  the  need  to  develop  resource  allocation  technologies  that  integrate 
diverse  services. 

Networking  resources  are  typically  managed  at  two  largely  different  time  scales,  determined  by  the 
service  request  inter-arrival  and  duration  time  scale  and  a  smaller  time  scale  over  which  certain  amount 
of  resources  need  to  be  allocated.  Management  of  resources  at  the  larger  time  scale  is  the  responsibility 
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of  the  call  admission  controller.  Management  of  resources  at  the  smaller  time  scale  is  the  task  of  the 
buffer/bandwidth  managers. 

Typically,  both  call  admission  and  buffer/bandwidth  management  functions  need  to  be  employed 
to  ensure  (sometimes  in  a  probabilistic  sense)  the  delivery  of  a  target  Quality  of  Service  (QoS).  The 
more  stringent  and  diverse  the  target  QoS  of  the  supported  applications,  the  more  critical  is  the  exercise 
of  the  aforementioned  management  and  the  more  sophisticated  their  functions  become.  Typically,  the 
more  stringent  the  target  QoS,  the  greater  the  importance  and  need  for  admission  management.  This 
is  the  case  in  the  telephone  network  supporting  voice  services  with  stringent  QoS  requirements.  In 
traditional  computer  data  networks  where  the  QoS  is  not  considered  to  be  stringent,  the  role  of  call 
admission  management  is  minimal  (if  any).  While  “uniform"  allocation  of  buffer/bandwidth  capacities 
would  result  in  delivering  a  target  QoS  -  provided  that  it  is  achievable  -  which  is  common  to  identical 
supported  applications,  such  allocation  will  be  ineffective  when  diverse  QoS  is  to  be  delivered  to 
applications  with  diverse  traffic  characteristics.  Sophisticated  buffer/bandwidth  management  schemes 
are  needed  to  provide  for  the  desired  diversification,  such  as  space  priority  for  buffer  management  and 
priority  scheduling  for  bandwidth  allocation  [3]. 

The  work  presented  in  this  paper  investigates  the  above  mentioned  problem  of  resource  management 
at  both  time  scales  when  the  supported  applications  are  diverse  in  terms  of  both  the  required  QoS  and 
traffic  characteristics.  In  addition,  the  QoS  requirement  is  assumed  not  to  allow  for  packet  (information 
unit)  buffering  beyond  the  present  service  cycle  and,  thus,  buffer  management  is  not  considered  to  be 
an  issue.  Packets  that  do  not  receive  service  over  the  cycle  following  their  arrival  time  are  considered 
to  have  excess  delay  and  are  dropped.  Applications  with  such  QoS  requirements  are  considered  to  be 
un-buffered  and,  thus,  the  problem  is  concentrated  on  the  management  of  the  available  transmission 
resources  at  both  time  scales. 

A  wireless  networking  environment  supporting  real-time  applications  has  characteristics  of  a  shared 
un-buffered  environment  in  which  the  management  of  the  transmission  resource  is  most  critical  due  to 
its  scarcity.  Since  the  associated  QoS  requirements  and  the  mobility  of  the  supported  users  do  not  seem 
to  allow  for  buffer  management  schemes,  the  work  in  this  paper  is  presented  in  the  context  of  wireless 
networking  to  provide  for  some  reasonable  motivation.  The  investigated  problem  of  the  achievable  QoS 
and  the  associated  scheduling  schemes  do  maintain  their  general  appeal  throughout  the  paper. 

Two  major  wireless  networking  technologies  have  recently  emerged:  cellular  systems  and  wireless 
Local  Area  Networks  (LAN).  Cellular  systems  -  originally  designed  to  support  voice  applications  - 
are  currently  facing  the  challenge  of  accommodating  data  and  video  services.  Similarly,  wireless  LANs 
-  developed  to  support  computer  data  applications  -  are  currently  being  redesigned  to  be  capable  of 
providing  service  to  image,  video  and  possibly  voice  applications.  The  convergence  of  the  services 
supported  by  these  two  technologies  drives  a  demand  for  a  single  network  architecture  capable  of 
supporting  all  aforementioned  services  in  an  integrated  fashion.  In  addition,  this  new  architecture  should 
be  compatible  with  the  prevalent  network  architecture  for  integrated  services  over  fiber/copper  based 
channels,  the  Asynchronous  Transfer  Mode  (ATM).  The  development  of  an  integrated  services  wireless 
network  technology  generates  new  transmission  resource  allocation  problems  that  are  more  challenging 
than  in  the  “specialized  services”  cellular  voice  systems  or  wireless  LANs. 

Consider  for  instance  the  transmission  resource  management  problem  in  a  cellular  (voice)  system. 
Since  the  supported  (voice)  applications  are  identical  -  in  terms  of  traffic  characteristics  and  QoS 
requirements  -  the  transmission  resource  management  problem  at  the  large  time  scale  is  relatively  easily 
addressed.  Given  a  certain  amount  of  available  resources,  the  maximum  number  of  voice  applications 
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that  can  be  supported  is  such  that  the  induced  ( common )  system  packet  dropping  probability,  ps,  be  less 
than  a  threshold,  typically  set  at  ps  <  0.01  [5].  Equivalently,  this  number  may  be  determined  as  the  one 
inducing  a  system  packet  dropping  rate,  ds  <  (0.01)(A*);  ds(\s)  is  defined  to  be  the  average  number  of 
nackets  dropped  (arrived)  per  service  cycle.  This  simple  rule  also  addresses  the  problem  of  determining 
the  number  of  required  transmission  resources  in  order  to  support  (simultaneously)  a  target  number 
of  applications.  The  uniformity  in  the  QoS  requirements  and  traffic  characteristics  of  the  supported 
voice  applications  simplifies  substantially  the  transmission  resource  problem  at  the  smaller  time  scale  as 
well.  A  uniform,  non-discriminating  scheduling  policy  would  induce  the  same  (system)  packet  dropping 
probability  to  all  supported  applications. 

The  above  discussion  on  resource  management  for  cellular  voice  systems  can  be  extended  to  th 
for  wireless  LANs.  The  QoS  requirements  of  the  supported  computer  applications  suggest  that  resource 
management  at  the  smaller  time  scale  is  of  primary  importance.  Again,  due  to  the  uniformity  of  the  QoS 
requirements,  the  resource  management  at  the  smaller  time  scale  is  not  nearly  as  challenging  as  that  in  a 

intearated  services  network.  .  .  .  . 

In  this  paper,  the  shared  transmission  resources  are  defined  to  be  the  slots  (packet  transmission  times) 

of  a  TDMA  frame.  This  resource  structure  has  been  widely  considered  in  both  cellular  systems  [7]  and 
wireless  LANs  [1],  as  well  as  in  recent  work  toward  the  development  of  wireless  ATM  networks  [10.6]. 
Variable  Bit  Rate  (VBR)  applications  with  distinct  QoS  requirements  and  traffic  characteristics  are 
considered.  The  region  of  achievable  multi-dimensional  (non-degenerate)  QoS  is  precisely  determined 
and  simple  scheduling  policies  that  deliver  the  achievable  QoS  are  developed. 

In  the  next  section,  the  system  model  considered  in  this  work  is  described.  A  set  of  equality  and 
inequality  constraints  are  presented  in  Section  3.1  that  are  employed  in  the  establishment  of  the  region 
of  the  achievable  QoS  vectors  in  Section  3.2.  The  derivations  of  Section  3  are  employed  in  source  call 
admission  control  and  resource  allocation  applications  and  presented  in  Section  4.  The  special  case  ot 
a  homogeneous  system  is  also  presented  in  Section  4.  Based  on  the  developments  of  Section  3  a  class 
of  simple  scheduling  policies  that  achieve  any  achievable  QoS  vector  is  developed  in  Section  5.  ome 
simple  numerical  examples  are  presented  in  Section  6.  followed  by  conclusion  of  this  work  in  Section  7. 

2.  Description  of  the  system  model 

In  this  paper,  the  problem  of  sharing  T  resources  by  N  diverse  Variable  Bit  Rate  (VBR)  source 
applications  is  considered.  The  source  packet  arrival  process  is  descnbed  in  terms  of  a  general  amval 
process  embedded  at  the  boundaries  of  fixed  length  intervals  called  service  cycles  (or  frames).  No 
additional  assumptions  for  the  packet  arrival  process  are  necessary  at  this  point.  Up  to  T  packets  may 
be  transmitted  (served)  during  each  service  cycle.  Packets  which  cannot  be  transmitted  over  the  service 
cycle  following  their  arrival  are  considered  to  have  excess  delay  and  are  dropped  at  the  source. 

The  above  environment  could  model  the  sharing  of  the  up-link  TDMA  frame  of  a  wireless  network 
by  N  applications;  T  slots  of  this  frame  are  available  to  these  applications.  Figure  1  illustrates  a  typical 
up-link  TDMA  frame  whose  slots  are  allocated  to  various  classes  of  services,  such  as  Continuous  Bit 
Rate  (CBR),  Available  Bit  Rate  (ABR)  and  Variable  Bit  Rate  (VBR).  In  this  figure  T  —  3  slots  are 
assumed  to  be  available  to  the  VBR  applications.  Let  M#i).  0<i  <  N,  denote  the  number  of  source  i 
packets  generated  at  the  nth  frame  boundary  and  requiring  service  over  this  frame.  If  L«=i  k"n) 
framen  is  said  to  be  underloaded-,  it  is  said  to  be  overloaded  if  hi")  >  T.Leta,  (n),  0  <  i  <  . 

denote  the  number  of  source  i  packets  serviced  during  the  nth  frame  under  some  scheduling  po  icy  J. 
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Request  Channel 


C  Assigned  CBR  slot 

V  Slot  available  for  VBR  traffic 

A  Slot  available  for  ABR  traffic 

Fig.  I.  TDM  A  frame  supporting  CBR,  VBR  and  ABR  traffic  classes. 


The  number  of  source  i  packets  dropped  during  frame  n  under  policy  /  is  given  by  dt  ( n ),  where. 


df  ( n )  =  A .,-(/*)  -  af  ( n ) 


=  0  < 
>  0  if  A.,(/i)  >  T 


(1) 


Let  df  =  E[df(n)],  af  =  E[af  (/i)]  and  X,  =  £[A.,(n)]  be  the  (assumed  time  invariant)  expected 
values  of  the  associated  quantities. 

Suppose  that  the  QoS  requirement  of  application  i  is  defined  in  terms  of  a  maximum  tolerable 
per  frame  packet  dropping  rate  dit  0  <  /  <  N.  l^en  the  QoS  vector  associated  with  the  supported 
applications  can  be  defined  in  terms  of  the  (performance)  packet  dropping  rate  vector  d. 


d  =  {dud, . dN). 


(2) 


When  the  QoS  requirement  of  application  i  is  defined  in  terms  of  a  maximum  tolerable  packet  dropping 
probability  pt,  the  corresponding  packet  dropping  rate  d,  is  easily  determined  by  </,  =  A.,  pt. 

The  first  question  addressed  in  the  sequel  (Section  3)  is  whether  a  given  QoS  vectors  d  is  achievable 
under  any  policy  /.  Necessary  and  sufficient  conditions  are  derived  in  order  for  the  QoS  vector  to  be 
achievable,  leading  to  the  precise  determination  of  the  region  of  achievable  QoS  vectors  d.  The  second 
question,  addressed  in  Section  4,  is  concerned  with  the  design  of  scheduling  policies  which  deliver  an 
achievable  QoS  vector  d. 


3.  Determination  of  the  region  of  achievable  QoS  vectors 

In  Section  3.1,  a  set  of  equality  and  inequality  constraints  associated  with  the  packet  dropping  rate 
vector  d/  induced  by  scheduling  policies  /  are  presented.  These  constraints  lead  to  the  derivation  of  the 
region  of  the  achievable  QoS  vectors,  presented  in  Section  3.2. 
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3.1.  Conservation  law  and  inequality  constraints 


Definition  3.1.  A  scheduling  policy  /,  is  work-conserving  iff  it  satisfies  the  following  conditions: 


1  =  1 

N 

when  ^X,(n)  <  T 

(underloaded  frame), 

(3) 

Y\o{(n)  =  T 

1  =  1 

N 

when  (n)  >  T 

i=i 

(overloaded  frame). 

(4) 

A  work-conserving  policy  does  not  waste  resources  (slots)  as  long  as  there  is  work  to  perform 

(packets  to  transmit).  Let  S  =  { 1. 2 . N)  be  the  set  of  all  sources.  Let  d{S)  denote  the  average  system 

packet  dropping  rate  under  scheduling  policy/,  given  by, 


Af  —  £ 
“{5|  -  C 


£x<n)  «£ £[<//(»>]  =  x> 

i=|  I  i=!  <  =  • 


The  following  theorem  provides  for  a  well  know  result  for  work-conserving  policies  [4]. 


(5) 


Theorem  3.1  (Conservation  Law).  The  system  dropping  rate,  d(S],  is  conserved  under  any  work- 
conserving  policy  f ;  that  is. 


4.  = 


X>,(n)  EWn)>r 


ieS 


ieS 


-T  P  fe>(-)  >  TJ  =bis | 


v/. 


(6) 


Notice  that  the  system  dropping  rate  is  independent  of  the  policy  f  and  only  depends  on  the  arrival 
process  and  the  amount  of  resources  T . 

Proof.  The  number  of  source  /  packets  dropped  during  frame  n  under  policy  /  is  given  by, 

df  (n)  =  Mn)  “  a! («)•  (7) 

Summing  (7)  over  all  sources  i  in  the  system  S  yields, 

T d'{n)  =  T  Ui(n)  -  a{ (n))  =  £><(«)  -  («)•  (8) 

its  *s  *s 

Assuming  /  is  work-conserving  and  using  Eqs.  (3)  and  (4)  of  Definition  3.1,  the  previous  equation 
becomes. 

0  £(€SMn)  <  T 

ErtSMn)>r. 

By  considering  the  expected  value  of  (9)  the  proof  of  the  theorem  is  completed. 

Let  d{g]  denote  the  average  subsystem  g  packet  dropping  rate  under  policy  /,  defined  by. 


ie5 


(9) 

□ 


dL\  - E 


£  </'<">  =  £  £[<(/(«)]  =  S9S  =  {i.2 

teg  J  ‘*8 


N). 


(10) 
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That  is,  d^  is  equal  to  the  cumulative  packet  dropping  rate  associated  with  sources  in  group  g  only, 
under  policy  /;  all  N  sources  in  5  are  assumed  to  be  present  and  served  under  policy  /. 

Let  b[g )  denote  the  lower  bound  for  the  cumulative  packet  dropping  rate  for  sources  in  g,  determined 
to  be  equal  to  the  packet  dropping  rate  of  a  system  in  which  only  sources  in  g  are  present  and  served 
under  a  work-conserving  policy.  Sources  in  set  {5  —  g)  are  considered  to  be  removed.  In  view  of 
Theorem  3. 1  and  by  applying  Eq.  (6)  for  the  subsystem  g,  the  following  is  derived. 


big)  = 


E 


*»•(«)  I  yM") >  t 

_ieg  itg 


Pr 


(ID 


The  following  theorem  provides  for  a  set  of  inequality  constraints  associated  with  scheduling  policies. 

Theorem  3.2.  The  following  constraints  associated  with  the  induced  packet  dropping  rate  vector  A  f  = 
(d{ ,  d{ . d{,)  are  satisfied  by  any  scheduling  policy: 


d(g]>b[gh  Vg  C  5  =  {1, 2 . N),  (12) 

where  d{g]  and  b^i  are  given  in  (10)  and  (11). 

Proof.  Suppose  that  the  packets  from  sources  in  g  are  served  under  a  work-conserving  scheduling  policy 
f0  in  which  they  are  given  Head  of  the  Line  (HoL)  priority  over  packets  from  sources  in  {5  -  g).  That 
is,  ho  packet  from  sources  in  (5  —  g}  is  served  unless  no  packets  from  sources  in  g  are  present.  As 
a  consequence,  packets  from  sources  in  g  do  not  experience  interference  from  the  traffic  delivered  by 
sources  in  {5  —  g}  and  thus, 

'Y^df‘(n)  =  0  when  ^A.,(n)  <  T,  (13) 

ieg  teg 

y^aj,’(n)  =  T  when  ^X,(n)  >  T.  (14) 

/e*  ieg 


From  ( 1 0)  and  ( 1 1 )  it  is  clear  that, 

for  all  work-conserving  policies  that  provide  HoL  priority  to  the  packets  from  sources  in  g.  Since  no 
other  policy  /  can  provide  better  service  (that  is  lower  packet  dropping  rates)  to  sources  in  g  than  f0,  it 
is  evident  that. 


d(s)  >  bw.  V/.  g  C  S.  □ 


(16) 


3.2.  Achievable  region  for  QoS  vectors 

The  main  result  of  this  section  is  the  determination  of  the  region  V  of  the  achievable  QoS  vectors 
d.  The  following  corollary  provides  a  set  of  necessary  conditions  in  order  for  a  QoS  vector  d  to  be 
achievable,  followed  by  a  corollary  regarding  an  upper  bound  on  the  achievable  region  for  QoS  vectors 
d.  Their  proofs  are  self-evident  in  view  of  Theorems  3.1  and  3.2. 
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Corollary  3.1.  A  necessary  condition  in  order  for  a  QoS  vector  d  -  (du  d2, ....  dN)  to  be  achieved  by 
a  work-conserving  scheduling  policy  f  is  that  its  components  satisfy  the  following  constraints. 

<iw>bul  Vic s.  Jj’j 


Corollary  3.2.  Let  Vu  denote  the  collection  of  all  vectors  d  satisfying  ( 1 7)  and  (18).  Then  Vu  is  an  upper 
bound  on  the  region  V  of  achievable  QoS  vectors  d.  That  is, 

pep".  (19) 


The  following  theorem  describes  the  vectors  contained  in  Vu. 

Theorem  3.3.  Anv  vecror  in  the  set  V “  can  be  expressed  as  a  convex  combination  of 

(vertices)  ofVu;  that  is,  Vu  maybe  expressedas  the  convex  hull  of  its  extreme  points.  V  =con  v(ext(V  )}. 

Proof.  The  proof  follows  from  the  fact  that  Vu  is  a  bounded  set  defined  by  a  finite  ^^sec“on  of  ^s^d 
half  spaces,  (see  (17)  and  (18)).  Then  by  definition,  Vu  is  a  polytope  (9]  and  Theorem  3.3  f  ^ 

directly  from  properties  of  polytopes  [9]. 

The  following  theorem  establishes  a  relationship  between  scheduling  policies  and  the  vertices  of  Vu . 

Theorem  3.4.  d*  is  a  vertex  of  the  set  Vu  iffd'  is  a  dropping  rate  vector  resulting  from  an  Ordered  HoL 

( O-HoL )  prioritv  service  policy.  tt=  (n\,  n2 . n n):  €  {1.2 .  }.  tr,  -  *•  j  -  ' 

The  index of  m  i indicates  the  order  of  the  priority  given  to  the  n.  source.  None  of  the  nj  sources,  j  >  i. 
may  be  served  as  long  as  packets  from  sources  nk,  k  <  i,  are  present. 

Proof.  Assume  that  d*  is  a  vertex  of  V.  Then  d*  must  lie  at  the  intersection  of  N  hyper-planes  and  its 
coordinates  must  satisfy  N  simultaneous,  linear  independent  equations  (definition  of  a  polytope  vertex. 

see  [9]).  given  by, 


£<*  =*«,.•  j  =  1.2. 


where  one  of  the  gf  s  is  the  set  S  =  {1. 2 . N)  and  the  remaining  (N  -  D  are  proper,  non-empty  and 

different  subsets  of  5.  Lemma  A.2  (see  appendix)  establishes  that  the  subsets  g,  s  are  stnctly  included  in 
each  other.  Therefore,  by  adopting  the  order  g,  C  gi  C  •  •  •  C  gs-i  C  gs  =  the  g,  s  are  given  oy, 

.  ,  (21) 

gi  =  (*il 

g\  c  g2  = 

g2  C  g3  =  {^1.^2.  ^Tj) 


gN-l  C  gN  ~  {JTi.^2 . **)  =  I1-2 . W' 
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Dropping  rate,  source  1 

Fig.  2.  The  region  (polytope)  V“  for  a  system  with  two  sources. 


where  7,-  6  {1,2 . N],  Jtj  ^  jij,  for  /  #  y.  1  <  i,  j  <  N.  Substituting  into  (20)  yields, 

^(jT||  =  ^(fi) 

^(.T;|  =  ^|f|..T;l  ~  i’l.T,) 

•:  (22) 

^(V*l  =  . awl  —  ®|ai,a: . a.v-il» 

which  is  precisely  the  dropping  rate  vector  induced  by  the  ordered  HoL  priority  policy  7r= 
(.ti.7 ri,. . .  ,7T/v).  Thus  for  any  vertex  there  exists  an  ordered  HoL  priority  policy  that  induces  it. 
Since  it  is  easy  to  see  that  the  dropping  rate  vector  resulting  from  an  ordered  HoL  priority  policy  must 
satisfy  (22),  it  is  evident  that  these  dropping  rate  vectors  will  be  vertices  of  Vu .  □ 

Figures  2  and  3  provide  a  graphical  illustration  of  the  region  Vu  for  the  case  of  N  =  2  and  N  =  3 
sources,  respectively.  The  extreme  points  d„f_,’s  correspond  to  QoS  vectors  induced  by  the  N\  ordered 

HoL  priority  policies  7r*=  (ttj  .  tti . 7T/y),  1  </  <  N\,  as  shown  in  Theorem  3.4.  Referring  to  Fig. 

2,  it  may  be  observed  that  the  policy  (jri,  jti)  =  (1,2)  corresponds  to  the  intersection  of  the  line  for  the 
lower  bound  on  the  packet  dropping  rate  line  for  source  1,  b\\\,  with  the  system  dropping  rate  line  b\ i.;|. 
Similarly,  the  second  extreme  point  induced  by  the  policy  (71,72)  =  (2,  1)  is  the  intersection  of  the 
lower  bound  on  the  packet  dropping  rate  for  source  2,  bp\  and  £|i.2|-  Similar  observations  can  be  made 
for  the  region  Vu  for  a  system  of  N  =  3  sources  shown  in  Fig.  3. 

Let  V,  V  C  Vu ,  be  the  achievable  region  of  QoS  vectors  d.  The  following  theorem  establishes  its 
convexity. 

Theorem  3.5.  Let  d| ,  di  €  V,  then  dj,  where 
dy  =  otd|  +  (1  —  oOdi,  a  >  0,  a  <  1, 
is  also  in  V.  That  is  V  is  convex. 


(23) 
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v 


Fig.  3.  The  region  (polytope)  V“  for  a  system  with  three  sources. 


(24) 

□ 


Proof.  Let  d,  (d;)  be  the  dropping  rate  vector  induced  the  achievable  policy  5,  (S:):  that  is.  di .  d2  €  V. 
Consider  a  scheduling  policy  S3  that  at  each  frame  decides  to  follow  the  scheduling  rule  of  po  icy  , 
with  probability  a  and  policy  5,  with  probability  (1  -  a);  the  decisions  over  consecutive  frames  are 
independent.  The  packet  dropping  rate  induced  by  policy  S3  is  given  by, 

d3  =  ad,  4*  (1  -  a)d3. 

Since  d3  is  achieved  by  a  policy,  d3  €  P,  establishing  the  convexity  of  P. 

The  next  theorem  establishes  the  region  P  of  achievable  QoS  vectors. 

Theorem  3,6.  P  =  P“. 

Proof.  Since  PCD“  (Corollary  3.2),  it  suffices  to  establish  that  Vu  C  V  to  complete  the  proof.  Notice 
that  Vu  =  conv[e.tr(Pu)]  (Theorem  3.3)  and  that  V  is  convex  (Theorem  3.5)  and  thus,  if  [ex:(U  )) 
c  V  then  Vu  C  V  .  The  latter  holds  true  since  the  extreme  points  of  Vu  are  induced  by  the  ordered  HoL 
priority  policies  (Theorem  3.4)  and.  thus,  these  points  are  in  the  region  V  of  achievable  QoS  vectors.  U 


4.  Applications 

In  the  proceeding  sections  a  system  of  N  sources  sharing  T  resources  has  been  considered  and  the 
re o ion  V  of  achievable  QoS  vectors  d  induced  by  any  work-conserving  policy  has  been  estabhshe  . 
If°the  amount  of  available  resources  T  is  fixed,  it  is  important  to  determine  whether  all  N  diverse 
applications  can  be  supported.  If  the  required  resources  can  be  made  available,  it  is  <mP°rtant  ^ 
determine  the  minimum  number  of  required  resources.  These  two  questions  are  addressed  in 


section. 


356 


J.M.  Capone  and  I.  Stavrakakis  / Performance  Evaluation  27&28  (1996)  347  365 


Dropping  rate,  source  1 

Fig.  4.  Achievable  region  V  and  call-admission  region  (polyhedral)  V*  for  a  system  with  two  sources. 


4.1.  Call  admission  control 

The  following  definitions  facilitates  the  discussion  of  this  section. 

Definition  4.1.  The  worst-tolerable  QoS-vector  d  =  {d\ ,  d2 . dN)  associated  with  the  N  applications 

is  defined  to  be  the  one  whose  coordinates  are  equal  to  the  largest  packet  dropping  rate  acceptable  by  the 


corresponding  application. 

Definition  4.2.  d1  =  (dj ,  d\ 
d'  <  d-  —  iff 

d}  <  df,  1  <i  <N. 


dlN)  is  said  to  be  better  than  d*  =  (df,  d; . dmN)  denoted  by 

(25) 


Definition  4.3.  The  QoS  of  N  applications  is  said  to  be  satisfied  if  there  exists  a  policy  inducing 
a  packet  dropping  rate  vector  d  (that  is  d  €  V)  which  is  better  than  the  worst-tolerable  QoS  vector 
associated  with  these  applications. 

Definition  4.4.  Let  &(il,  g  C  S  =  {1. 2 . N)  be  the  lower  bound  on  the  cumulative  packet  dropping 

rates  associated  with  subsets  of  N  given  application  and  T  available  resources  (  blg]  is  defined  precisely 
earlier);  let  V  be  the  associated  region  of  achievable  QoS  vectors.  The  call  admission  region  V  is 
defined  to  be  the  set  of  dropping  rate  vectors  d  =  (d\ ,  di . dN)  satisfying. 

<4,1  2  *1,1.  £  s-  C6) 

Notice  that  for  g  =  S,  (26)  is  satisfied  with  strict  equality  for  the  dropping  rate  vectors  in  V.  Thus, 
V+  is  a  polyhedral  having  V  as  one  of  its  faces  [9].  A  graphical  representation  of  the  call  acceptance 
region  V+  is  shown  in  Figs.  4  and  5  for  a  system  of  N  =  2  and  N  =  3  sources,  respectively. 
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Fig.  5.  Achievable  region  V  and  call-admission  region  (polyhedral)  V*  for  a  system  with  three  sources. 

The  following  theorem  establishes  a  call  admission  rule. 

Theorem  4.1.  Let  d  =  (dud2 . dN)  be  the  worst-tolerable  QoS  vector  associated  with  N  given 

applications.  Then  if  d  is  in  the  call-acceptance  region  T>+.  then  the  QoS  of  the  N  applications  can  be 
satisfied. 

Proof.  The  proof  is  self-evident  provided  that  there  exists  a  policy  that  induces  a  dropping  rate  vector 
d\  such  that  d'  €  V  and  d'  <  d.  The  latter  holds  true  if. 

d\t\  >  d[s]  >  bw  Vg  C  S,  (27) 

d[S\  >  ^|5)  =  ^(51  • 

which  is  established  in  [12]  by  denoting  V  to  be  the  core  of  the  game  (5.  b).  According  to  [12]  the  core 
of  game  (S,  b)  consists  of  all  vectors  d  which  satisfy  d|si  =  b{S)  and  d[g]  >  b{g]  for  all  g  C  S  where 
dp,  the  core  of  the  game  is  defined  to  be  large  if  the  conditions  in  (27)  are  satisfied  for  every 

d  6  V+.  A  sufficient  condition  for  the  core  of  the  game  to  be  large  is  for  the  set  function  b[g |  to  be  super 
modular,  see  Theorem  4  of  [12];  this  result  has  been  established  in  the  appendix,  see  Lemma  A.  1 .  □ 

4.2.  Resource  allocation 

As  implied  earlier  in  the  discussion  associated  with  Fig.  1.  applications  in  integrated  services 
networks  are  not  expected  to  be  allowed  to  access  all  available  resources.  Some  broad  groups  (or 
classes)  are  expected  to  be  defined  and  only  a  proportion  of  the  resources  will  typically  be  available  to 
each  group.  If  it  is  desirable  that  a  certain  number  of  N  applications  be  supported,  it  is  important  that 
the  minimum  amount  of  required  resources  to  be  alloted  to  that  group  be  determined.  This  problem  is 
discussed  below. 
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To  emphasize  the  dependence  of  the  quantities  P,  P+  and  b (4)  for  all  g  C  S,  on  T,  they  will  be 
written  explicitly  as  functions  of  T  and  be  written  as  V(T),  V+(T)  and  b[g)(T),  respectively. 

Theorem  4.2.  Let  d  =  (d\,d2 . dN)  be  the  worst-tolerable  QoS  vector  associated  with  N  applica¬ 

tions.  The  minimum  amount  of  resources  -  denoted  by  Tm,„  -  required  in  order  for  the  QoS  of  these 
applications  to  be  satisfied  is  given  by, 

r„„  =  ( T.  T  >  1  :  [  (4 i  >  »w<n.  V*  £  S)  and  (28) 

(4.1  <  ~  »  ^»“|,£S)]|. 


Proof.  Since  d  e  V+(T),  where  V+{T)  is  the  call  acceptance  region  determined  by  T  and  g  c  S). 
the  QoS  of  the  N  applications  will  be  satisfied  (Theorem  4.1).  To  establish  that  Tm(„  determined  by 
(28)  is  indeed  the  minimum  it  suffices  to  show  that  b[g](T)  are  non-decreasing  functions  of  T .  This  is 
obvious  since  the  lower  bound  on  the  dropping  rates  for  a  group  g,  b[g |,  cannot  increase  as  the  number 
of  service  opportunities  fixed  service  cycle  increase.  0 


4.3.  The  special  case  of  homogeneous  applications 


In  this  section  the  theory  developed  earlier  is  applied  to  a  system  supporting  identical  applications. 
That  is.  both  the  packet  arrival  process  and  the  largest  acceptable  packet  dropping  rates  are  identical  In 

this  case,  the  worst-tolerable  QoS  vector  will  have  identical  coordinates;  that  is  d  =  ( dv,du . du). 

The  following  theorems  establish  a  (simplified)  call  admission  rule  for  a  homogeneous  system. 

Theorem  4.3.  Let  d  =  (dv.  dv . du)  be  the  worst-tolerable  QoS  vectors  associated  with  the  N 

homogeneous  applications.  The  vector  d  is  in  the  call-acceptance  region  V+  (and.  thus,  the  QoS  of  all 
N  applications  is  satisfied)  if 


*  *  ^{51 

ds  >  bs  or  dv  > 

N 


(29) 


Proof.  The  definition  of  V+  (Definition  4.1)  and  the  call-admission  rule  (Theorem  4.1)  establish  that 
(29)  is  a  necessary  and  sufficient  condition  provided  that  it  implies  that, 

b{g) 


d[g)  >  b\g\ 


or  dv  > 


III 


C  5, 


where  |#|  denotes  the  number  of  elements  (sources)  in  set  g.  In  view  of  (29),  (30)  will  hold  if, 

v5cs. 

N  ~  \g\  S  ~ 


(30) 


(31) 


Since  all  the  sources  are  statistically  identical,  set  functions  b[g j  associated  with  sets  g  having  the  same 
number  of  sources  are  equal.  That  is, 

b[ai . am)  =  bU . m)>  V(a, . am ),  m  =  1,2 . N,  a,  a(,a;  €  5. 


(32) 
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Using  (32)  and  Lemma  A.l, 


26(0||  —  b[ai]  +  f*(a.)  <  ^(ai.a;) 

3^(a,.a;l  =  ^la,.a;)  +  £|a,,aj)  +  <  ^{a,.fl:.aj|  +^(ai)  +  ^(a:.oj|  <  -b[aua2.a,) 


(33) 


(l$l  +  1)*UI  <  tel^lsua)  a  £  {g). 


Equivalently, 


^Igua)  >  bM 

tel  + 1  tel 


a  <f.  te). 


and  (31)  immediately  follows  from  this  result. 


(34) 


5.  A  class  of  policies  inducing  all  QoS  vector  in  V 

Let  Cq-hoL  denote  the  class  of  ordered  HoL  priority  service  policies  7r*’s  introduced  in  Section  3.2. 


Definition  5.1.  A  mixing  ordered  HoL  priority  service  policy  fm  is  defined  to  be  one  that  at  each 

frame  decides  to  follow  the  ordered  HoL  priority  policy  7rl=  (tt j ,  Tri . n‘N)  with  probability 

a,  ,  a,  >0,  1  <  /  <  N\ ,  £jl!|  a,  =  1;  decisions  over  consecutive  frames  are  independent.  Clearly.  fm 
is  completely  determined  by  the  N\  dimensional  vector  a,  a.  >  0,  1-  a  =  1.  Let  Mo-HoL  denote  the 
class  of  such  policies. 


The  following  theorem  establishes  the  main  results  of  this  section. 

Theorem  5.1.  For  each  packet  dropping  rate  vector  d  6  “D  there  exist  a  policy  fm  €  Mo-HoL  that 
induces  d. 

Proof.  Let  d  €  V.  Then  d  =  a«d«»-/  f°r  some  Q=  . “«'•)  where  a‘  -  1  -  \  - 

If  jt  ctj  =  1,  since  any  point  in  'D  can  be  written  as  a  convex  combination  of  the  extreme  points 

(vertices)  d of  £>(=  £>“);  each  d,x,_,  is  induced  by  some  policy  in  C0-hoL  (Theorem  3.4). 

Let  fm  be  the  mixing  policy  which  selects  the  HoL  priority  7 r*  (that  induces  d,.t/_,  )  with  probability 
a,.  The  packet  dropping  rate  vector  Afm  induced  by  fm  is  given  by. 

dfm  =  £Vd«(-/  (35^ 

1  =  1 

and  thus  fm  induces  d.  ^ 

The  following  corollary  is  obvious  in  view  of  the  above  theorem. 

Corollary  5.1.  Let  d  e  V  be  a  target  packet  dropping  rate  vector.  The  mixing  policy  fm  =  a  induces  d. 
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where  a  is  such  that, 

(36) 

(37) 

(38) 

where  E  q[.]  is  weighted  average  of  the  set  of  extreme  points.  dexl,  ofV  with  respect  to  the  Probability 
Mass  Function  a. 


N'. 

E  Oc  [df.tf]  =  y  \<Xjdext-i  =  d. 
1=1 

a  >  0. 
1-  a  =  1. 


6.  A  numerical  example 

In  this  section  a  numerical  example  is  carried  out  for  a  system  with  three  sources  competing  for  T 
slots  in  a  TDMA  frame.  The  source  packet  arrival  processes  are  assumed  to  be  mutually  independent. 
Each  arrival  process  is  described  in  terms  of  an  underlying  finite-state  Markov  chain  embedded  at  the 
frame  boundaries.  The  number  of  packets  generated  (and  requesting  service)  by  a  source  in  the  current 
frame  boundary  is  (probabilistically)  determined  by  the  present  state  of  the  associated  underlying 
Markov  chain. 

Consider  in  this  example  the  typical  TDMA  frame  in  which  each  slot  represents  a  32  Kbps  channel. 

Let  Ek  =  (0,  1 . Mk  -  1}  denote  the  state  space  of  the  Markov  chain  associated  with  source  k. 

k  =  1,2,  3.  In  this  example  M'  =  2,  M2  =  3  and  M2  =  4.  When  in  state  i,  a  source  generates  21- 
packets  in  the  current  frame.  Source  1,  has  a  maximum  transmission  rate  of  64  Kbps,  it  can  model 
an  uncompressed  voice  source  [2].  Source  2  is  the  superposition  of  two  uncompressed  voice  sources. 
Source  3  could  be  used  to  model  a  low  rate  video  transmission  with  maximum  rate  6  slots/frame  =  192 
Kbps  and  step  size  of  64  Kbps.  Further  description  of  all  three  sources,  including  transition  rates,  can 
found  in  Tables  1 , 2  and  3;  af  (fif)  denote  the  transition  rates  from  state  i  to  state  i  +  1  (state  i  to  /  -  1 ). 
of  source  k:  i.  i  -  1.  /  +  1  €  £*,  k  =  1.  2, 3.  The  per  frame  arrival  rates  are  easily  computed  and  are 
equal  to  A.,  =  0.8.  k2  =  1.6  and  ky  =  4.0  packets  per  frame. 

Let  pk  denote  the  target  packet  dropping  probability,  for  source  k,  k  =  1.2.3.  In  this  example 
it  is  assumed  that  p,  =  0.02,  p2  =  0.04  and  py  =  0.02.  By  multiplying  these  probabilities  by  the 
corresponding  per  frame  arrival  rate,  the  per  frame  packet  dropping  rates  d{  =  0.016.  d2  =  0.064 
and  dy  =  0.08  are  obtained.  Thus,  the  worst-tolerable  QoS  vector  d  is  given  by  d  =  (d\,d2,  dy)  - 
(0.016. 0.064, 0.08)  and  the  associated  system  packet  dropping  rate  is  equal  to  d[S\  =  d\  +  d2  +  dy  = 
0.160.  By  applying  Theorem  4.2  it  is  established  that  the  minimum  amount  of  resources  required  in 
order  for  d  to  be  in  the  call-acceptance  region  are  Tmin  =  9  slots  per  TDMA  frame. 


Table  I 


Description  of  source  1 

State,  i  or,1 

State  probability 

0  0.667 

0.6 

1  -  1.0 

0.4 

Table  2 

Description  of  source  2 


State,  i 

a - 

fir 

State  probability 

0 

1.330 

- 

0.36 

1 

0.667 

1 

0.48 

-> 

_ 

-> 

0.16 
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Table  3 

Description  of  source  3 


State,  i 

af 

a3 

State  probability 

0 

2.7000 

- 

0.05 

1 

4.4000 

0.9 

0.15 

2 

0.6818 

1.2 

0.55 

3 

- 

1.5 

0.25 

In  this  case  d{S i  >  b[S i  =  0.1552  and  any  vector  d'  satisfying  conditions  in  Theorem  4.1  can  be 
selected.  The  (non!unique)  vector  d'  =  (0.016. 0.064. 0.0752)  was  found  by  requiring  (last  consent  in 
Theorem  4  1)  that  rf'  =  blS\  =  0.1552.  Since  d'  €  V,  there  exists  a  mixing  ordered  HoL  pnonty  pohc> 

fm  =  a  achieving  exactly  the  QoS  vector  d'.  Any  a  satisfying  the  condmons  of  Corollary  5.1  may  be 

chosen.  For  this  example  and  employing  linear  programming  techniques,  the  following  a0  was  oun  . 

a,  =  0.1280 

a.  =  0.0000 


O-o  = 


a3  =  0.2843  |  (39) 

a4  =  0.1810 

a5  =  0.2678 
[  a6  =  0.1389  _ 

As  it  is  expected  from  the  formulation  of  the  constraints,  more  than  one  solution  may  be  found. 
This  allows  for  the  incorporation  of  additional  constraints  representing  other  desirable  qualities '  of  ™ 
policies.  Functions  of  interest  may  be  minimized  subject  to  the  constraints  presented  in  this ;  paper  to 
guarantee  the  achievability  of  the  resulting  policies.  For  instance,  among  all  mixing  policies  indue, n id. 
the  one  which  minimizes  the  variance  of  the  service  provided  to  certain  sources  may  be  identified, 
additional  objectives  will  be  pursued  in  the  future. 

To  further  illustrate  the  design  concept,  consider  the  following  two  user  system.  In  this  example 
M{  =3,  M2  =  4.  As  in  the  first  example,  when  in  state  «  a  source  generates^  packets  in  the  cunen 
frame.  Source  1  and  source  2  have  probability  state  distributions.  (0. 1 . 0.3. 0.6)  and  (0.5  0.3. 0.1^.  ). 
respectively.  The  per  frame  arrival  rates  are  easily  computed  and  are  equal  to  X,  -  j.O  and  A*  -  10 

PaCInelhiselxfarpeie  the  worst-tolerable  QoS  vector  d  is  given  by  d  =  {d\  d2)  =  (0-7®1’  ^ 

the  associated  system  packet  dropping  rate  is  equal  to  d[S |  -  d\  +  d2  -  ^  "  1  Althoush 

system  packet  dropping  rate  can  be  satisfied  with  7  =  5  slots,  that  is  d[S\  -  {S|  •  •  ~ 

with  7  =  5  d,  <  b,  =  0.10.  and  thus  the  worst-tolerable  QoS  vector  is  not  achievable.  In  this  case 

the  best  possible  performance  that  source  2  can  receive  is  if  it  is  given  pnonty.  resulting  dropping 
rates  of  0.780  and  0.10  for  source  1  and  2  respectively.  The  overall  system  performance  is  satisfied,  bu 
source  2  is  experiencing  poorer  service  than  its  worst-tolerable,  while  source  1  is  expenencing  improve 
performance.  Applying  Theorem  4.2  it  is  established  that  the  minimum  amount  of  resources  required 
order  for  d  to  be  in  the  call-acceptance  region  are  Tmi„  =  6  slots  per  TDMA  frame. 
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7.  Conclusion 

In  this  work  the  region  of  achievable  QoS  vectors  for  an  integrated  services  wireless  network  was 
precisely  determined.  The  QoS  in  a  shared  un-buffered  environment  (wireless  network)  is  measured  by 
the  packet  dropping  probability,  p, .  The  determination  of  the  region  of  achievability  for  this  environment 
was  first  made  possible  by  noticing  that  a  conservation  law  applies  to  the  packet  dropping  rate.  d,,  from 
which  a  packet  dropping  probability  can  be  inferred  as  p,  =  f:-  Secondly,  the  super-modularity  of  the 
set  function  b[g),  describing  the  lower  bound  on  the  cumulative  packet  dropping  rate,  made  it  possible  to 
classify  the  extreme  points  (vertices)  of  the  region  of  achievability. 

Knowledge  of  the  region  of  achievable  QoS  vectors  led  to  the  development  of  an  admission  control 
rule  and  resource  allocation  techniques  for  the  applications  in  the  integrated  services  wireless  network. 
Admission  control  for  a  special  case  of  a  homogeneous  system  was  considered.  The  anticipated  result. 
dv  >  *111,  or  equivalently,  p,  >  jfj-,  which  had  been  seen  in  [7]  as  a  sufficient  condition  for  admission 
to  a  cellular  voice  network,  was  derived. 

Once  it  was  established  that  the  target  QoS  vector  was  in  the  achievable  region,  easy  to  implement 
scheduling  policies  were  derived  that  achieve  the  desired  result.  It  was  shown  that  there  exists  a  class  of 
scheduling  policies  -  mixed  Ordered  HoL  policies  -  that  can  achieve  any  achievable  QoS  vector. 


(40) 


Appendix  A 

Lemma  A.l.  b.\  is  a  super-modular  set  function,  that  is, 
bA+bB  <  b(AuB|  +  b[Ar\B\ 
and  equality  hold  only  if  A  C  B  dr  B  C  A. 

Proof.  Let  XA  (XB)  be  the  generic  random  variables  representing  the  number  of  packets  generated  by 
sources  in  set  A  (B)  over  a  frame.  A,  B,C  S.  Clearly, 

XA  +  XB  =  A.|^ufi|  +  ^|Mns|  VA,  B  C  S. 

Case  (1):  Assume  A  and  B  are  disjoint  (A  n  B  =  0).  In  this  case  X(Aufli  =  XA+XB  and. 

b[AvB\  —  { E  j^/A  +  XB  j  XA  +  XB  >  7"j  —  7"  j  Pr(XA  +  XB  >  T) . 

The  following  implication  is  self-evident: 

XA>  T.Xb  <T 


(41) 


(42) 


XA  +  XB  >  T 


XA  <  T,  XB  >  T 
XA  >  T,XB  >  T 
XA  <  T,XB  <  T,XA  +  XB  >  T. 


(43) 


Assuming  that  all  sources  are  independent  then  XA  and  XB  are  independent  since  sets  A  and  B  are 
disjoint,  then, 

?r{XA+XB  >  T)  =  Pr(XA  >  T)P(Xb  <  T)  +  Pt(Xa  <  T)  P  (XB  >  T) 

+Pr(XA  <  T.Xb  <  T,Xa  +XB  >  T)  +  Pt(XA  >  T)  P  (XB  >  T) , 


(44) 
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therefore  (42)  may  be  written  as, 

Wl  =  (£[A-'*  +  Xfl|^>r’XB-7]“7'|Pr(XA  >  r)Pr(XB-r) 

+  j  E  [x*  +  \B  ^  <  T,Xb  >  r]  -  ij  Pr(kA  <  T)  Pr  (kB  >  T) 

+  E  \A+xB  ^  >  t,xb  >  r]  -  r}  Pr(XA  >  T)Pr(kB  >  T) 

+  e  xA  +  xB  kA  <  t,xb  <  t,Xa  +  *«  >  r]  -  r) 

•Pr(XA  <T,kB  <  T,kA  +  kB  >  T). 


(45) 


Expanding  the  expected  value  operator  over  and  kB  and  rearranging  terms,  (45)  can  be  written  as, 

blAuB)  =  \e [a*  |  kA  >  r]  -  r)  Pr (^  >  T)  +  \e [xb  |  kB  >  r]  -  r j  Pr(xfl  >  t )  (46) 

+E  kA  kA  <T  ?r(kA  <T)?T(kB  >  T) 

+E  kB  kB<T  Pr  (kA  >  T)  P  (kB  <T)  +  TPr(kA  >  T)  P  (kB  >  T) 

+  {£[^  +  ^|xA<T.x5<r.xA  +  x.>  rj-r) 

-  Pr  (XA  <  T,kB  <  T,kA  +  kB  >  T) 

=  ^|A|  +  b{  B\  +  Cx-x« 


where, 

Ck*-k‘  =  E  [x*  |  kA  <  rj  Pr  (kA  <  T)  Pr  (kB  >  T) 

+E  [xB  |  kB  <  T]  Pr  (kA  >T)P  ( kB  <T)  +  TPt  (kA  >  T)  P  (kB  >  T) 

+  |  E  +  kB  |  kA  <  T,kB  <  T,  kA  +  kB  >  tJ  —  7"] 

•  Pr(XA  <  T.kg  <  T.kA+kB  >  T). 

Notice  that  Ck*k‘  >  0  (for  the  cases  of  interest  corresponding  to  T  >  0).  Thus,  from  (46),  and  since 
b[AC\B)  —  0  for  {A  O  £}  —  0* 

bA  +  bB  =  b[AuB)  -  CkA,k‘ =  b[A<jB)  + b[Ar\B\  -  Ck/,'k‘ ,  ^ 


bA  +  bB  <  +  b\Ar\B\- 

Case  (II):  Assume  the  A  and  B  are  not  disjoint  (A  n  B  #  0).  By  writing  {A  U  B)  as  the  union  of  the 
disjoint  sets  (A)  and  ( B  -  [A  n  £}}  and  using  the  results  from  Case  I  (46),  the  following  can  be  written. 

b{AvB\  =  6(Au(0-|-4nfl)|  =  bA  +  fe|0-|^nsn  +  C  *’ 

Similarly,  by  writing  set  B  as  the  union  of  the  disjoint  sets  {A  n  B)  and  [B  —  {A  n  B)},  the  following 
can  be  written, 

be  =  b[AnB)  +  b[B-[AnB\\  +  ^ 


(50) 
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or, 

^(S-Mnsji  =bB~  bunt,)  ~ 

(51) 

Finally,  by  combining  (49)  and  (51),  the  following  is  obtained, 

b[AuB]  +  b[AnB]  =  bA  +  bB  + 

(52) 

If  A  C  B  then, 

(53) 

Similarly  if  B  C  A.  If  A  (f.  B,  then  (Aflfi)c^  and. 

(54) 

completing  the  proof  of  the  lemma.  u 

Lemma  A.2.  The  sets  gj.  j  =  1,2 . N,  that  satisfy  Eq.  (20).  defining  the  vertices  of  the  polytope  Vu. 

must  be  strictly  included  in  each  other;  that  is 

either  hj  =  gj  —  (gj  fl  gk)  =  0  or  hk  =  gk  —  ( gk  ^gj)  —  0. 


for  all  gj  and  gk  in  (20). 

Proof.  Let  gi  and  gi  be  two  sets  satisfying  (20).  By  adding  the  corresponding  equations  the  following  is 
obtained. 


b\g>)  +  big>\  =  £</*  +  12d‘  ~  £  d‘  +  £  ^ 

i£gt  ieg:  ielgiUg;!  i€  lgirtg:} 


(56) 


or. 


(57) 


£  di  =  ^l*'l  +blSi\  -  £  di- 

ieUtUi:)  i€|*ins;| 

Since  for  any  subset  of  S  (17)  must  be  satisfied,  then  for  {^i  n^2}.  £l€(ilng2|  di  -  ^ts.r>s:l’ the  f°llowin2 
is  obtained  from  (57), 

X]  d!  -  big>\  +  bigi\  -  ^58) 

Suppose  now  that  g\  and  g2  are  not  strictly  included  in  each  other.  That  is  /i|  =  gi  —  (g\  H  #2)  and 
hi  =  gi  —  (#i  H  gi)  are  both  non-empty.  Then,  Lemma  A.l  implies  that, 

b\g>)  +  <  big^gi\  +  ^ 


or, 

b\gi)  +  b[g:)  -  b{g<r\gl\  <  bis,Ug:)- 

Therefore, 

£  di  -  fyjil  +  ^Is:)  _  ^liinS:l  <  blgtUS:) 

*€{£|U£;} 


(60) 


(61) 
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or, 

5"!  d'i *  < 

ie|*iUg:l 

which  implies  d*  i  Vu,  this  is  a  contradiction  since  d* 
strictly  included  in  each  other  follows. 


(62) 


is  a  vertex  of  Vu.  Thus  the  assumption  that  g,  are 

□ 
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Study  of  Delay  Jitter  With  and 
Without  Peak  Rate  Enforcement 


Randall  Landry,  Member ;  IEEE ,  and  Ioannis  Stavrakakis,  Senior  Member ,  IEEE 


Abstract — In  this  work,  the  modification  of  a  tagged  traffic 
stream  due  to  statistical  multiplexing  is  studied  by  presenting 
a  numerical  approach  for  the  calculation  of  the  tagged  delay 
jitter  and  interdeparture  processes.  Both  the  single-  and  multiple- 
node  cases  are  considered.  Unlike  the  past  work,  the  developed 
approach  is  applicable  under  both  the  standard  FCFS  policy 
and  a  peak-rate  enforcing  multiplexing  policy.  The  latter  policy 
can  be  adopted  to  reshape  the  tagged  traffic  stream  within  the 
network.  Restoring  a  shaped  traffic  profile  within  the  network 
may  be  necessary  to  obtain  some  of  the  benefits  for  which  the 
original  shaping  is  carried  out  at  the  network  edge.  This  study 
also  provides  results  and  insight  regarding  the  potential  gains  of 
reregulation  within  the  network. 

Index  Terms — author:  please  supply. 


I.  INTRODUCTION 

HE  asynchronous  transfer  mode  (ATM)  is  presently  con¬ 
sidered  to  be  one  of  the  most  promising  switching  and 
multiplexing  schemes  for  B-ISDN’s.  Although  it  is  possible  to 
achieve  high  levels  of  network  efficiency  (utilization)  through 
the  statistical  multiplexing  of  bursty  sources,  it  is  not  always 
easy  to  guarantee  quality  of  service  (QoS).  Since  at  a  given 
time  instant  the  cumulative  peak  rate  of  the  supported  ATM 
users  may  exceed  the  network  capacity,  serious  congestion 
may  occur,  resulting  in  QoS  degradation.  Source  traffic  shap¬ 
ing  and/or  regulation  has  been  proposed  as  a  means  of  reducing 
traffic  burstiness,  thereby  controlling  network  congestion. 

It  is  well  understood  that  statistical  multiplexing  may  in¬ 
duce  substantial  distortion  in  an  ATM  cell  stream  [1].  Thus, 
characterization  of  a  tagged  traffic  stream  within  the  network 
seems  to  be  necessary  in  order  to  accurately  evaluate  potential 
congestion,  as  well  as  determine  the  QoS  delivered  to  the 
tagged  stream  through  a  per-session  end-to-end  study.  In 
addition,  a  corrective  action  (that  is,  traffic  shaping)  may  be 
necessary  within  the  network  for  the  purpose  of  reducing 
distortion,  which  is  a  potential  cause  of  network  congestion. 

In  this  work,  the  distortion  of  a  (regulated)  source  traffic 
stream  due  to  a  series  of  multiplexing  operations  is  studied 
under  the  standard  FCFS,  as  well  as  a  “corrective”  peak-rate 
enforcing,  multiplexing  discipline.  The  tagged  traffic  stream 
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is  characterized  at  the  output  of  a  multiplexing  node  in  terms 
of  the  cell  delay,  delay  jitter,  and  interdeparture  processes. 
A  comparative  study  of  the  two  multiplexing  disciplines  is 
carried  out  based  on  these  quantities,  and  useful  insight  is 
obtained  regarding  the  resulting  distortion  of  the  tagged  traffic 
stream.  Finally,  the  node-by-node  approximate  description  of 
the  transformation  of  a  tagged  stream  under  the  considered 
multiplexing  disciplines  is  used  for  an  approximate  study  of 
the  end-to-end  QoS  delivered  to  the  tagged  traffic  stream. 

Past  work  regarding  the  characterization  of  a  tagged  traffic 
stream  due  to  a  single  FCFS  multiplexing  operation  may 
be  found  in  [2],  [3].  In  [2],  a  method  is  developed  for 
calculating  the  cell  interdeparture  time  distribution  of  a  general 
renewal  stream  under  an  FCFS  policy.  Statistics  for  the 
interdeparture  process  of  an  initially  periodic  cell  stream  in 
a  FCFS  multiplexer  are  also  computed  in  [3]  by  assuming  a 
first-order  Markovian  structure  for  the  cell  delay  process. 

The  problem  of  measuring  end-to-end  performance  in  an 
ATM  environment  is  difficult  and  largely  open.  Typically,  a 
specific  application  (session)  is  being  tagged  and  observed  as 
it  traverses  the  network.  Deterministic  bounds  on  delay  have 
been  derived  in  [4]-[7],  where  the  traffic  from  each  session 
is  assumed  to  conform  to  predefined  burstiness  constraints. 
These  approaches  are  based  on  worst  case  performance  and  the 
bounds  are  extremely  loose,  resulting  in  the  underutilization 
of  available  resources  when  used  for  call  admission  purposes 
[8].  Provable  statistical  bounds  have  been  derived  in  [9],  [10], 
which  also  appear  to  be  too  loose  to  be  of  practical  use  in  call 
admission.  An  alternative  approach  is  to  provide  approximate 
statistical  QoS  guarantees  [11]-[13]  by  modeling  the  traffic  at 
the  network  edge  (or  within  the  network  itself)  and  analyzing 
the  resulting  queueing  systems.  While  this  approach  has  the 
advantage  of  simplicity,  it  generally  suffers  from  the  traffic 
assumptions  which  must  be  made  at  the  network  premises. 
Past  work  on  approximate  end-to-end  network  performance 
evaluation  based  on  nodal  decomposition  and  FCFS  multi¬ 
plexing  may  be  found  in  [  14]— [  17].  Although  the  end-to-end 
study  presented  in  this  paper  is  based  on  “standard"  nodal 
decomposition  assumptions  as  in  [14],  [17],  the  single-node 
analysis  approach  is  different,  allowing  for  the  consideration 
of  policies  which  cannot  be  studied  by  employing  the  past 
approaches. 

In  the  next  section,  the  fundamental  block  of  this  work  is 
considered,  namely,  a  queueing  system  under  a  service  policy 
which  occasionally  departs  from  the  FCFS  policy  to  enforce 
a  peak  output  rate  (minimum  cell  interdeparture  time)  for  the 
tagged  traffic  stream  (Class- 1).  It  is  shown  (Section  II-C)  that 
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the  proposed  service  policy  allows  for  a  characterization  of  the 
tagged  cell  stream  which  is  consistent  with  the  GCRA  defined 
in  [18].  Furthermore,  when  the  peak-rate  enforcer  is  disabled 
(by  properly  setting  a  key  parameter),  the  proposed  policy 
becomes  the  standard  FCFS  policy  considered  in  [19],  [3]. 
In  Section  IV,  the  approximate  end-to-end  study  is  described 
along  with  a  discussion  of  important  assumptions.  Numerical 
results  are  presented  in  Section  IV  before  concluding  in  the 
final  section. 

II.  ANALYSIS  OF  A  SINGLE  MULTIPLEXING  STAGE 

There  are  two  main  objectives  in  this  work.  First,  the 
distortion  in  a  tagged  traffic  stream  at  a  single  node  due  to  the 
standard  FCFS  as  well  as  a  distortion-reducing  service  policy 
will  be  studied.  This  study  will  reveal  the  amount  of  distortion 
introduced  by  the  policies  considered,  as  well  as  provide  for 
a  description  of  the  tagged  traffic  stream  at  the  output  of  the 
multiplexing  stage.  The  latter  corresponds  to  the  tagged  stream 
arriving  at  the  next  multiplexing  stage,  and  thus,  the  next  stage 
can  be  studied  in  a  similar  manner  considering  the  tagged 
traffic  stream  as  shaped  by  the  previous  multiplexing  stage.  By 
reiterating  the  above  over  a  series  of  multiplexing  stages,  an 
approximate  end-to-end  study  of  a  tagged  traffic  stream  can  be 
carried  out,  revealing  the  end-to-end  impact  of  the  FCFS  and 
the  distortion-reducing  multiplexing  disciplines  on  a  tagged 
stream,  which  is  the  second  major  objective  of  this  work. 

A.  The  Distortion-Reducing  Peak  Output-Rate 
Enforcing  (PORE)  Service  Discipline 

The  PORE  service  discipline  is  associated  with  a  tagged 
traffic  stream  (Class- 1),  and  treats  the  rest  of  the  traffic  in  an 
indistinguishable  manner  (Class-2  or  background  traffic).  As 
it  becomes  clear  below,  the  PORE  service  strategy  for  Class- 1 
guarantees  that  cells  belonging  to  this  class  are  transmitted  at  a 
minimum  spacing  of  Xm\n  slots.  The  server  operates  according 
to  the  FCFS  discipline,  except  for  those  times  when  the  FCFS 
server  would  place  consecutive  Class- 1  cells  on  the  output  link 
at  a  distance  of  less  than  Xmin.  In  this  paper,  it  is  assumed 
that  under  the  FCFS  policy,  Class- 1  cells  are  served  before  the 
Class-2  cells  which  arrive  over  the  same  slot. 

Let  tk  denote  the  service  epoch  of  the  kth  Class- 1  cell, 
defined  as  the  beginning  of  the  slot  over  which  this  cell  is 
transmitted.  Let  Xk  =  f*+i  -  tk  denote  the  interdeparture 
time  of  Class- 1  cells  k  +  1  and  k.  Similarly,  tk  will  mark 
the  potential  service  epoch  of  the  kth  Class-1  cell,  which  is 
defined  to  be  the  beginning  of  the  slot  at  which  all  other  cells 
that  arrived  prior  to  the  kth  Class- 1  cell  have  been  served. 
Note  that  under  the  FCFS  discipline,  t*  =  VA;  >  1. 

The  PORE  service  discipline  can  be  described  in  terms  of 
the  following  rules. 

1)  Whenever  ik  -  fjt-i  <  Xm\n ,  the  server  delays  the 
initiation  of  service  to  the  kth  Class- 1  cell  by  exactly 
Hk  =  Xm\n  -  (ik  -  4-i)  slots,  which  will  be  referred 
to  as  the  holding  time  of  the  A:th  cell  for  the  remainder 
of  the  paper. 

2)  During  this  holding  time  of  length  //*,  the  server  will 
provide  FCFS  service  to  Class-2  cells  that  may  be 
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present  in  the  buffer.  This  implies  that  during  the  holding 
interval,  up  to  Hk  Class-2  cells  can  be  served  before  the 
A:th  Class- 1  cell  which  arrived  at  an  earlier  time. 

3)  When  the  holding  time  expires,  the  A;th  Class- 1  cell 
is  served;  thus,  tk  =  +  Hk,  thereby  ensuring  that 

Xk  —  i  “  -^min  • 

Note  that  according  to  rule  2),  the  server  abandons  the  FCFS 
discipline  whenever  a  Class- 1  cell  is  held  (Hk  >  I),  and  thus  it 
provides  improved  service  to  Class-2  compared  to  that  under 
the  FCFS  policy.  The  PORE  service  policy  reduces  to  the 
FCFS  policy  when  Xmjn  =  1,  in  which  case  Hk  =  0,  Vfc  >  1. 

Let  the  Class- 1  cell  interarrival  process  be  denoted  by 
{>U}jb>i,  where  Ak  represents  the  interarrival  time  (in  slots) 
between  Class-1  cells  k  and  k  +  1.  It  is  assumed  that  Ak  G 
[Amim  oo )  C  Z+,  where  Z+  denotes  the  set  of  positive 
integers.  Class-2  traffic  is  considered  to  be  the  background 
traffic  whose  presence  stands  to  corrupt  {>U}fc>i  through  the 
queueing  process. 

B.  Jitter  (Cell  Delay  Variation)  and  the  PORE  Policy 

An  important  measure  of  distortion  of  a  real-time  applica¬ 
tion’s  cell  stream  is  delay  jitter,  or  cell  delay  variation.  When  a 
session  experiences  a  large  amount  of  delay  jitter,  or  distortion 
to  its  traffic  profile,  extensive  buffering  may  be  required  at  the 
receiver  in  order  for  the  original  data  stream  to  be  recreated 
before  being  played  back.  Clearly,  this  implies  that  network- 
induced  delay  jitter  can  be  removed  at  the  receiver  at  the 
expense  of  larger  buffers  and  potentially  increased  delays.  In 
view  of  this,  it  seems  reasonable  to  expect  that  the  end  user 
buffering  requirements  can  potentially  be  significantly  reduced 
by  controlling  delay  jitter  within  the  network.  The  implication 
here  is  that  some  of  the  buffers  can  be  displaced  from  the 
receiving  end  to  the  network  premises  where  resources  are 
shared  by  more  than  one  user.  Finally,  besides  creating  the 
need  for  increased  buffering  at  the  receiver,  an  initially  well- 
behaved  cell  stream  can  induce  queueing  problems  and  QoS 
degradation  for  existing  sessions  as  it  becomes  distorted  and 
potentially  more  bursty  within  the  network. 

A  number  of  definitions  for  delay  jitter  have  appeared  in 
the  literature,  including  those  which  compute  the  probability 
distribution  of  cell  delays  and  measure  the  difference  between 
lower  and  upper  quantiles  [3],  [20].  In  this  paper,  a  cell-level 
definition  for  delay  jitter,  like  the  one  considered  in  [3],  will  be 
adopted  to  describe  the  amount  of  distortion  introduced  into 
the  Class- 1  cell  stream. 

Definition  1 :  Let  Dk  denote  the  total  queueing  delay  (in 
slots)  of  the  kth  Class-1  cell,  including  the  transmission  slot. 
The  delay  jitter  of  the  kth  cell  is  defined  as 

Jk  =  Dk+i  -  Dk  for  k>  L  □  (1) 

In  terms  of  the  quantities  defined  in  the  previous  subsection, 
an  equivalent  definition  of  delay  jitter  is  given  by 

Jk  =  Xk-  Ak  for  k  >  1.  (2) 

Note  that  J*  measures  the  degree  to  which  the  interarrival 
time  Ak  of  Class- 1  cells  k  and  k  +  1  has  been  distorted,  and 
it  can  take  both  positive  and  negative  values.  Negative  delay 
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jitter  applies  to  those  instances  when  Class- 1  cells  k  and  k+  1 
are  transmitted  at  a  spacing  of  less  than  A*  (cell  clustering), 
while  positive  delay  jitter  applies  to  those  times  when  cells  k 
and  k- h  1  are  transmitted  at  a  spacing  of  greater  than  A*  (cell 
spreading).  Note  that  when  the  process  {Ajt}fc>i  has  a  finite 
region  of  support  (i.e.,  A*  €  [Amjn,  Amax]),  the  PORE  policy 
completely  eliminates  a  portion  of  the  negative  delay  jitter  by 
guaranteeing  that  Jk  >  Xm[n  -  Amax. 

Since  providing  a  minimum  spacing  of  Xmin  eliminates 
some  of  the  negative  delay  jitter,  it  would  be  reasonable  to 
investigate  whether  the  PORE  policy  induces  probabilistically 
larger  positive  delay  jitter  than  the  FCFS  policy.  This  is 
addressed  in  the  following  theorem. 

Theorem  1:  Consider  any  Class- 1  arrival  stream  with  min¬ 
imum  cell  interarrival  time  Amin.  Let  JfCFS  (j£ORE)  denote 
the  delay  jitter  of  the  klh  Class- 1  cell  under  the  FCFS  (PORE) 
service  policy.  If  Xmin  <  Amint  then  for  all  A:  >  2, 

Pr  { Jr°RE  >  j}  <  Pr  {-4FCFS  >  j}  Vi  >  0 

with  equality  when  Xmin  =  1. 

Proof:  Recall  that  if  the  A:th  Class-1  cell  is  held,  it  is 
guaranteed  to  be  served  at  time  tk  such  that  Xk  =  Xm\n.  This 
implies  that  a  held  cell  will  never  be  placed  on  the  output  link 
at  a  distance  less  than  Xmin  from  the  previous  Class-l  cell. 
Since  Xm\n  is  always  less  than  Am in,  then  Xk  <  A*.  From 
(2),  this  implies  that  Jk  =  Xk  -  Ak  cannot  be  greater  than  or 
equal  to  zero,  or  positive  delay  jitter  is  never  induced  through 
this  output-rate-enforcement  action.  In  fact,  the  positive  delay 
jitter  is  decreased  since,  while  the  A:th  Class-1  cell  is  being 
held,  the  server  is  attending  to  Class-2,  and  consequently,  the 
service  time  of  the  (k  -f  l)st  cell  is  possibly  moving  closer  to 

tk •  □ 

Theorem  1  establishes  the  fact  that  regulating  Class- 1  by 
enforcing  a  peak  output  rate  does  not  increase  the  positive 
delay  jitter  of  this  traffic  class.  The  PORE  service  policy 
actually  reduces  the  two  types  of  distortion  most  commonly 
induced  by  multiplexers,  namely,  spreading  and  clustering. 

C.  The  Generic  Cell  Rate  Algorithm  (GCRA)  for  ATM 

The  negative  impact  of  delay  jitter  on  an  ATM  connection 
(as  well  as  other  supported  sessions  in  the  network)  has 
been  recognized  by  the  ATM  Forum  [18],  and  one  of  the 
consequences  has  been  the  development  of  the  GCRA,  which 
establishes  a  relationship  between  peak  cell  rate  Rp  and  cell 
delay  variation  tolerance  (CDVT).  Rp  is  defined  as  the  inverse 
of  the  minimum  cell  interarrival  time  at  the  user;  assuming  that 
Amin  is  the  minimum  interarrival  time  contracted  by  the  user, 
then  Rp  =  1/Amjn.  On  the  other  hand,  CDVT  represents  a 
bound  on  the  cell  clustering  phenomenon  previously  described, 
and  it  is  defined  in  relation  to  Rp  according  to  the  GCRA  as 
explained  below. 

The  basic  function  of  the  GCRA  is  to  update  a  theoretical 
arrival  time  (TAT),  which  is  simply  the  expected  arrival 
time  of  a  cell  assuming  that  cells  are  equispaced  when  the 
source  is  active.  Let  denote  the  actual  arrival  time  of 
the  Arth  cell  at  some  observation  point  within  the  network. 
If  tk  is  not  “too  early**  relative  to  the  TAT,  or  if  tk  > 


Fig.  1.  The  discrete*time  queueing  system. 

TAT  —  r  for  some  small  r  >  0,  then  the  cell  is  said  to  be 
conforming.  The  corresponding  cell  stream  is  said  to  conform 
to  GCRA(Amin,  t),  where  r  is  the  value  of  CDVT.  In  this 
paper,  it  will  be  assumed  that  r  takes  only  integer  values. 

A  cell  stream  conforming  to  GCRA(Am;n,  r)  may  be  con¬ 
sidered  to  have  a  contracted  peak  rate  of  1/Amjn  and  a 
“tolerated**  peak  rate  of  1/Xmin ,  where 

^min  ~  Amin  ~  T. 

Clearly,  the  PORE  service  policy  described  above  can  be 
used  to  ensure  that  the  Class- 1  cell  stream  conforms  to 
GCRA(Amin,  r)  at  the  output  of  any  multiplexer  or  switching 
site  within  the  network. 

The  appropriate  value  of  r,  which  is  to  be  chosen  by 
the  user  from  a  set  of  values  supported  by  the  network,  is 
the  topic  of  ongoing  research  [18],  but  it  is  expected  that 
under  certain  operating  conditions,  the  impact  of  CDVT  on 
network  resources  will  be  significant.  Clearly,  a  r  decreases 
the  more  well  behaved  the  cell  stream  becomes.  In  terms 
of  the  quantities  associated  with  the  PORE  policy,  when 
-Xmin  =  Am in,  t  becomes  zero  and  the  Class- 1  cell  stream 
conforms  to  GCRA(Amjn,  0),  which  happens  to  be  the  func¬ 
tion  of  the  Virtual  Shaper  defined  in  [18].  Providing  this  strict 
conformance  with  the  contracted  peak  rate  will  undoubtedly 
have  a  positive  impact  on  the  QoS  delivered  to  other  traffic 
streams  along  the  same  Virtual  Path.  As  previously  mentioned, 
the  present  study  will  focus  on  the  impact  that  CDVT  has  on 
the  conforming  (or  tagged)  cell  stream  in  terms  of  delay  jitter, 
cell  interarrival  times,  and  cell  delays. 

D.  Queueing  Analysis 

In  this  section,  an  ATM  multiplexer  operating  under  the 
PORE  service  policy  (Fig.  1)  is  studied  in  terms  of  perfor¬ 
mance  measures  associated  with  the  Class-1  cell  stream.  The 
Class-1  interarrival  process  {Ajt}jk>i  is  assumed  to  be  i.i.d. 
with  distribution  /0(rri),  Amin  <  m  <  Amax«  Note  that 
a  periodic  process  with  period  T  can  be  considered  when 
Amin  =  4max  =  T  and  fa(T)  =  1. 

Class-2  traffic  is  modeled  in  terms  of  i.i.d.  batches  of  cells 
of  arbitrary  distribution.  Let  the  random  variable  Bj — with 
probability  mass  function  (PMF)  A,  (A;),  k  >  0 — denote  the 
number  of  Class-2  cells  arriving  over  an  interval  of  length  j  (in 
slots).  Clearly,  fbj(-)  can  be  derived  as  the  j-fold  convolution 
of  fbti')'*  let  Nb  be  the  maximum  value  of  B\. 

Let  Q(n;  j)  denote  the  number  of  Class-2  cells  which 
arrived  before  time  epoch  j  and  are  still  in  the  system  at  time 
n,  n  >  j.  The  following  proposition  determines  the  evolution 
of  { Q(n\  j)}n>j  under  certain  conditions,  and  will  be  useful 
for  the  study  of  the  queueing  system. 
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lk-i  “*  U-i  tk  time  — 


Fig.  2.  Realization  for  the  derivation  of  D*  and  Hk  when  i  <  Am*x 


lk-i  -*  tk  time  — 


Fig.  3.  Realization  for  the  derivation  of  Dk  and  Hk  when  i  >  /tmax- 

Proposition  1:  The  process  {Q(j  +  m\  j)}^^  evolves  as 
the  occupancy  process  of  a  Geo/D/1  queue  with  batch  arrivals 
distributed  according  to  fbx{')i  provided  that  no  Class-1  cells 
are  served  over  the  interval  [j,  j  +  n].  In  this  case, 

Pr  {<30'  +  j)  =  t\  <30;  j)  =  *} 

_  p(m) 

”  rktt 

=  E  pkT1)pi<‘  (3) 

» 

where  pffi  (Pite)  is  the  m-step  (one-step)  transition  prob¬ 
ability  for  the  Geo/D/1  queue;  P{te  =  /*,(/  —  i  +  1)  + 

/6l(°)l{f-fi=0}*  □ 

The  queueing  system  will  be  described  in  terms  of  the 
Markov  chain  {Dk,  Hk}k>i*  embedded  at  the  sequence  of 
time  instants  {tk}k> i;  tk  marks  the  service  epoch  of  the  A:th 
Class- 1  cell  as  defined  earlier;  D*  and  Hk  denote  the  total 
delay  and  holding  time  of  the  A:th  cell,  respectively. 

The  transition  probabilities  for  the  Markov  chain 
{Djt,  Hk}k>i  will  be  derived  by  first  establishing  expressions 
for  Dk  and  Hk  given  that  (Dk-\,  Hk-\)  =  (x,  j).  where 
i  >  1  and  0  <  j  <  Xmin  -  1.  In  order  to  simplify  notation, 
the  quantity  Q(tk-i\  tjt— l  —  »  +  j),  which  appears  frequently 
in  the  following  expressions,  will  be  denoted  by  Q*.  Also,  let 
Qk  denote  the  number  of  Class-2  cells  in  queue  at  the  arrival 
epoch  (time  tk-\  —  i  +  Ak-i)  of  the  k\h  Class-1  cell.  That  is, 

Qk  =  Q(*fc-i  -  i  +  —  i  + 

When  i  <  Amax,  the  expressions  for  Dk  and  Hk  can 
be  derived  by  referring  to  Fig.  2,  where  i  <  Ak-\  for  the 
realization  shown  there.  Note  that  z  can  also  take  values  greater 
than  or  equal  to  Ak-\  while  still  being  less  than  Am&x.  and 
these  cases  are  also  accounted  for  in  the  expressions  given 
below  in  (4)  and  (5),  when  z  >  Amax  (Fig.  3),  shown  in  (6) 
and  (7). 

Let  SL  =  {(t,  j)  :  L(Am „  -  1)  +  1  <  *  <  (L  + 
-  1).  0  <  j  <  Xmin  -  1}  represent  “level”  L  of 
the  Markov  chain  {D*t  fc>i*  Given  the  relationships  in 
(4H7),  the  transition  probabilities  for  {D*,  Hk}k>u  which 
will  be  denoted  by  p(z,  j,  i\  j'),  can  be  computed  in  terms  of 
the  quantities  previously  defined.  The  quantities  p(z,  j,  z',  ?), 
for  (z,  j)  G  S'0  (transitions  from  level  0)  are  given  by  (8) 


and  (9),  which  are  derived  from  (4)  and  (5)  above,  while  the 
transition  probabilities  for  (z,  j)  €  Sl  are  given  by  (10)  and 
(11),  derived  from  (6)  and  (7)  above. 

For  (i,  j)  e  5°, 

P(i,  J,  °)  =  E  E 

n=Atnin  m= 0 

(N‘-t)i  N‘(i-i+ 1) 

+  E  E  E 

n=Aa  1=0  m=0 

(8) 

where  A,  -  min{t,  i4m«x},  A2  =  max{i4min,  i  +  1}  and 
Ni  =  min  {(n  -  j)Nb,  i'  -  1  —  *  +  n}. 

p(h  j>  *  .  j  )  —  E  E  /q(n)^n— 

n=Ami„  m= 0 

Am„ 

+  E  E  E  Mn)Ptf 

n=A?  1=0  m  =  0 

•  ^  -  j + 1  ( m )  t  pfn+1 7'  -  j ' + pLn;tol)] 

•  (9) 

where  N2  -  min  {(n  -  j)Nk,  Xmtn  -  1  -  /}. 

For  (i,  j)  €  S1, 

P(* .  J.  *'»  °)  =  E  E 

n=Amin  m=  0 

*  tn-j(m)Po^-.i+n-m-l  (10) 


p(*.  j,  i')  =  E  E  /«(n)fcn-i(m) 

n=Amin  m=0 

By  lexicographically  mapping  the  quantities  p(z,  j,  i\  jf) 
into  p(ty  t)  through  the  relations  /  =  (z  -  l)Xmin  4-  j  and 
t!  =  (z;  -  l)Xmin  +j\  the  transition  probability  matrix  P  of 
the  Markov  chain  {Dfc,  Hk}k>\  can  be  written  as  a  stochastic 
matrix  of  the  M/G/l  type: 


[Bo 

B! 

b2 

B3  •••! 

Ao 

Aj 

a2 

A3  ••• 

0 

Ao 

Ax 

a2 

0 

0 

Ao 

Ax  ••• 

where  the  matrices  Bn  and  An,  n  >  0,  have  dimensions 
(Amajc  -  l)Xmin  x  (AmJVX  -  l)J\rmin.  Note  that  the  matrices  Bn 
consist  of  the  transition  probabilities  for  which  (z,  j)  6  5°  and 
(z',  f)  G  5n,  while  the  matrices  An  consist  of  the  transition 
probabilities  for  which  (z,  j)  G  S1  and  (z',  f)  G  Sn.  The 
system  in  (12)  can  now  be  solved  as  outlined  in  the  Appendix 
based  on  the  matrix  analytic  techniques  presented  in  [21].  The 
stationary  probability  vector  r  for  P  is  mapped  back  into  the 
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space  (i,  j,  i\  f)  to  obtain  the  stationary  probabilities  for 
{At,  Hk } Jt> i »  given  by  7r(i,  j)t  for  1  <  i  <  oo,  0  <  j  < 
Xmin 

The  PMF  for  cell  delay  D  of  Class- 1  is  easy  to  calculate  in 
terms  of  7r(i,  j),  and  it  is  given  by 

^min  "  1 

Pr  {£>  =  »}=  ^  tt(z,  j).  (13) 

3=0 

By  considering  the  definition  in  (l),  the  delay  jitter  distri¬ 
bution  is  easily  determined  to  be 

00  «Xmin”l  "I 

pr  {j  =  k}  =  ^r  ^2  ^2  p(*.  j.  * + *.  j'm*.  j)- 

i= 1  i=n  i'==n 


Computing  the  PMF  of  the  Class- 1  cell  interdeparture  time 
X  is  not  as  straightforward  as  computing  that  of  J.  From 
(1)  and  (2),  the  interdeparture  time  of  cells  fc  +  1  and  k 
is  given  as  Xk  =  Dk+\  -  Dk  +  Ak.  Since  the  Markov 
chain  {Dk,  Hk}k>\  does  not  provide  adequate  information 
to  describe  the  interdeparture  process,  the  system  will  be 
described  by  the  Markov  chain  { Dk ,  Hk ,  /U}*>i  in  order  to 
compute  the  PMF  of  X.  Let  p(t,  jf,  m,  f}  m')  denote  the 
transition  probabilities,  and  7r(i,  j}  m)  denote  the  stationary 
probabilities  for  this  Markov  chain.  It  is  easy  to  show  that  the 
PMF  for  the  cell  interdeparture  process  is  given  by 

00  X  min  —  1  A  X  min  —  1  Am*s 

Pr{x  =  n>  =  £  E  E  E  E 

1  —  t  j — 0  m  —  Amin  j1  —  0  rn>  —  Am,n 

•  P(»,  j,m,n  +  i-  m,  j',  m')n(i,  j,  m) 

(15) 

where  A!  —  min  {n  +  t  -  1,  Amax}. 

The  quantities  7 r(t,  j,  m)  can  be  computed  in  the  same 
manner  as  the  quantities  7r(i,  j),  with  the  exception  that  the 


dimensionality  of  each  level  in  P  has  increased  by  a  factor 
of  Amix.  Note,  however,  that  upon  arrival  of  the  kth  Class- 
1  cell,  Dk  is  determined.  That  is,  Dk  is  equal  to  the  queue 
content  at  the  arrival  instant  plus  the  amount  of  holding  time 
Hk  to  be  suffered  by  the  cell.  The  latter  quantity  is  completely 
determined  by  the  past  (i.e.,  the  service  time  of  the  (Jfc  -  l)st 
cell),  and  neither  Dk  nor  Hk  is  dependent  upon  the  next  Class- 
1  arrival.  Consequently,  the  random  variables  Dk  and  Hk  are 
both  independent  of  the  random  variable  Ak ,  and  the  stationary 
probabilities  for  the  Markov  chain  {Dk}  Hky  j4jk}jk>i  can  be 
computed  as  7r(i,  j,  m)  =  7r(i,  j)fa{m),  where  7r(i,  j)  and 
/a(m)  are  as  given  above. 


III.  Approximate  End-to-End  study 

The  N-node  tandem  queueing  system  used  for  the  approxi¬ 
mate  end-to-end  performance  evaluation  of  the  tagged  session 
is  shown  in  Fig.  4.  The  PORE  service  policy  will  be  adopted 
at  each  node  so  as  to  allow  for  the  consideration  of  peak-rate 
enforcement  within  the  network.  In  order  to  simplify  notation, 
the  Class- 1  cell  interarrival  (interdeparture)  process  at  node 
i  will  be  denoted  as  Ax  (X*).  The  arrival  process  at  node  i 
(l  <  i  <  N)  consists  of  the  tagged  (Class- 1)  interdeparture 
process  from  node  i  —  1,  and  some  background  traffic  (Class-2) 
delivering  batches  of  cells  distributed  according  to  the  PMF 
/t^*)*  X°  is  defined  to  be  the  tagged  cell  intergeneration 
distance  at  the  source;  A1  may  be  slightly  different  from  X° 
due  to  potential  truncation  (as  discussed  below). 

The  set  of  possible  integer  values  which  can  be  taken  by  the 
Class- 1  interarrival  processes  at  nodes  1  -TV  must  be  truncated 
in  order  to  facilitate  a  numerical  solution  at  each  node.  The 
minimum  (maximum)  Class- 1  interarrival  time  at  node  i  will 
be  denoted  by  Almin  (4*max).  That  is,  A{  €  [/t*min,  A^}  for 
:  =  1,  2,  •  •  • ,  /V.  The  minimum  interarrival  time  for  i  >  1  is 
simply  =  X^.  The  choice  of  A max  for  »  >  1  will  be 


and 


(  1 


Dk  =  l 


Qk 

Qk  +  Hk 

B*k-i -j  +  Q*  +  1  +  i  —  j4*_i 
BAk-t-j  +  <?*  +  !  +  *-  +  Hk 


if  =  0  and  Ak~,  >  ATmjn  +  t  —  1  and  t  <  Ak- 1 
if  Qk  >  Xmxn  +  i  -  Afc_i  and  i  <  Ak 
if  Qk  <  Xmin  +  i  -  Ak-i  and  i  <  Ak-i 
if  Q’  >  Xmin  -  Ba,..,-,  -  1  and  i  >  Ak., 
if  Q  ^  -^min  —  Ba^^i—j  1  ^nd  i  >  Ak—i 


(  0 


Hk  =  { 


0 

^"min  "b  *  Ak— i  Qk 


P 

v  X min  Bai,—i—3  Q 


if  Qk  =  0  and  Ak- 1  >  Xmin  +  i  -  1  and  »  <  Ak-\ 
if  Qk  >  Xm-,n  +  i  -  Ajt_i  and  t  <  A*-i 
if  Qk  ^  ^min  “b  i  and  t  <  Ak— i 

if  Q‘  >  Xm\n  -  BAt.t-j  -  1  and  t  > 

1  if  Q’  <  Xmin  -  BAi-x-j  -  1  and  t  > 


(4) 


(5) 


n  _  /  BAk_t-j  +  Q‘  +  1  +  t  -  Ak.i  if  (?*  >  Xmin  -  BA,.,-j  -  1  and  -ci  >  Ak-, 

Vk~\  Bau.,-3  +  Q*  +  1  +  i  ~  Ak- 1  +  Hk  if  Q*  <  Xmin  -  BAk_x-j  -  1  and  «  > 

„  _/  0  if  Q‘  >  Xmin  -  BAk.l-j  -  1  and  i  >  Ak-X 

k  ~  \  Xmin  -  BAk.x-j  -  Q*  -  1  if  Q*  <  *mi„  -  Ba^-j  -  1  and  i  >  Ak-i. 


(7) 
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A*  =  Xl  AN=XN-l 


Fig.  4.  Tandem  queueing  network  for  end-to-end  performance  evaluation. 


made  according  to  the  rule 

=  max{fc:  Pr{A''~1  =  fc}  >  f}  (16) 

where  e  can  be  any  positive  real  number  less  than  1.  The  PMF 
for  the  process  A\  i  >  1,  is  then  given  by 

pT{A'  =  k}  = 

rPr  [X{~1  =k}  ifO<k<A\nAX 

^4X-i 

<  1-  £  ?r{Xi-'=j}  if*  =  >lLx  (17) 

i=i 

,  0  otherwise. 

It  is  clear  from  (16)  and  (17)  that  as  e  approaches  zero,  A'matX 
increases  and  the  truncated  Class- 1  interarrival  time  PMF  at  a 
node  approaches  the  actual  Class- 1  interdeparture  time  PMF 
from  the  previous  node.  The  cost  of  improving  the  accuracy 
of  the  adopted  Class-1  interarrival  process  by  increasing  Almax 
is  a  corresponding  increase  in  numerical  complexity.  Recall 
that  the  dimensionality  of  the  square  matrices  Bn  and  An% 
n  >  0,  which  make  up  the  transition  probability  matrix  P  for 
a  single  queue,  increases  linearly  with  the  maximum  Class- 1 
interarrival  time  (see  Section  II-D),  which  in  turn  affects  the 
tractability  of  the  solution  technique  outlined  in  the  Appendix. 

A  number  of  other  assumptions  will  be  made  in  order  to 
facilitate  the  proposed  approximate  end-to-end  study.  First,  it 
will  be  assumed  that  only  the  tagged  traffic  stream  is  forwarded 
from  one  multiplexer  to  the  next  as  shown  in  Fig.  4.  All  of  the 
remaining  traffic  (background)  at  each  multiplexer  is  routed 
elsewhere.  Implicit  in  this  assumption  is  the  presence  of  a 
large-scale  networking  structure,  in  which  there  exist  a  large 
number  of  independent  sessions  and  a  large  number  of  total 
nodes  (only  N  of  which  serve  the  tagged  session).  In  such 
a  network,  it  is  expected  to  be  reasonable  to  assume  that, 
at  the  output  of  each  node,  most  of  the  untagged  sessions 
(background  traffic)  are  split,  or  routed  along  paths  which  are 
different  from  that  of  the  tagged  session. 

It  is  also  assumed  that  the  interdeparture  process  {ATfc}*>i 
from  each  node  is  an  i.i.d.  process.  This  assumption  is  neces¬ 
sary  for  employment  of  the  analysis  of  Section  II-B  at  each 
multiplexing  stage.  The  validity  of  the  i.i.d.  assumption,  as 
well  as  the  effect  of  truncating  the  interarrival  time  PMF  at 
each  node,  will  be  examined  through  a  comparative  simulation 
study. 

The  analysis  presented  in  the  previous  section  for  a  single 
queue  will  be  applied  successively  to  each  node  along  the  path 
in  Fig.  4,  and  the  PMF  for  the  interdeparture  process  at  node 
N  will  be  used  to  establish  the  amount  of  delay  jitter  induced 
in  the  cell  stream  of  the  tagged  session  over  all  N  hops. 


TABLE  I 

Squared  Coefficient  of  Variation  for  Cell  Inter- Departure  Time  (C2) 


load 

It 

cn 

o 

II 

Ft 

II 

K5 

O 

0.70 

0.0236 

0.0086 

0.0028 

0.80 

0.0489 

0.0209 

0.0079 

0.95 

0.1160 

0.0607 

0.0296 

IV.  NUMERICAL  RESULTS  AND  DISCUSSION 

In  this  section,  some  numerical  results  arc  presented  to 
characterize  the  distortion  induced  in  a  tagged  traffic  stream 
under  both  FCFS  and  peak-rate  enforcing  service  strategies. 
Before  considering  end-to-end  performance,  the  delay  jitter 
induced  at  a  single  ATM  multiplexer  will  be  studied. 

In  order  to  provide  for  a  simple  interpretation  of  the 
multiplexer-induced  distortion,  the  Class-1  arrival  process  is 
assumed  to  be  periodic  with  period  T  slots  in  the  results 
presented  in  this  section.  In  addition,  a  typical  regulated 
traffic  stream  (whose  reshaping  may  be  attempted  within  the 
network)  can  be  considered  to  resemble  a  periodic  one  for 
some  time  horizon  which — in  some  cases — may  be  sufficient 
to  induce  “local”  stationary  statistics.  It  should  be  emphasized, 
though,  that  the  study  presented  in  this  paper  is  applicable 
to  traffic  streams  described  in  terms  of  general  interarrival 
processes.  The  Class-2  batch  arrival  process  is  assumed  to 
be  binomially  distributed  with  mean  rate  A6  and  maximum 
batch  size  Nb  =  15.  Table  I  displays  the  squared  coefficient 
of  variation  for  the  Class- 1  cell  interdeparture  time  process 
when  an  FCFS  policy  is  adopted.  The  squared  coefficient  of 
variation  will  be  denoted  by  and  given  as 

2  _  var(AT) 
x  E2{X) ' 

Note  that  Cl  is  equal  to  zero  if  no  distortion  is  introduced  into 
the  periodic  arrival  stream.  From  Table  I,  it  is  clear  that  delay 
jitter  is  the  greatest,  under  the  FCFS  policy,  for  small  periods 
T  and  high  offered  loads. 

The  multiplexer-induced  distortion  can  also  be  quantified  by 
observing  the  PMF  for  J  as  in  Fig.  5.  In  this  case,  the  delay 
jitter  PMF  is  plotted  for  T  =  20  and  the  offered  loads  given 
in  Table  I.  Notice  that  when  the  total  load  is  0.7,  much  of  the 
periodicity  is  retained  in  the  Class- 1  arrival  stream,  which  is 
illustrated  by  the  fact  that  most  of  the  probability  mass  exists 
at  J  =  0.  However,  as  the  load  increases  to  0.95,  much  of 
this  mass  is  redistributed  and  the  Class-1  arrival  process  no 
longer  resembles  a  periodic  source.  It  is  also  interesting  to 
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Fig.  6.  Jitter  PMF  for  T  =  10  and  various  values  of  A'min  at  a  single 
multiplexer. 


note  that  the  delay  jitter  PMF  appears  to  be  symmetric  around 
zero  when  the  service  policy  at  the  multiplexer  is  FCFS;  this 
symmetry  disappears  when  a  peak  output  rate  is  enforced. 

In  Fig.  6,  the  delay  jitter  PMF  is  considered  for  different 
values  of  Xmm ,  T  =  10  and  a  total  load  of  0.9  (A6  =  0.8). 
When  Xmin  is  set  to  T  —  1  (in  this  case  9),  the  tagged  cell 
stream  conforms  to  GCRA(10,  1)  (see  Section  II-C),  and 
the  delay  jitter  process  possesses  a  very  narrow  region  of 
support.  On  the  other  hand,  under  an  FCFS  policy,  a  significant 
amount  of  cell  clustering  is  observed  since  the  cell  stream  now 
“conforms”  to  GCRA(10,  9);  cell  spreading  is  also  slightly 
increased. 

The  cost  of  providing  a  less  distorted  traffic  stream  at  the 
multiplexer  output  is  illustrated  in  Fig.  7  in  terms  of  increased 
cell  delays.  As  discussed  in  Section  II-B,  however,  the  effect  of 
reducing  distortion  at  the  network  premises,  thereby  increasing 
queueing  delays  (and  buffer  requirements),  is  essentially  to 
distribute  the  buffers  more  uniformly  throughout  the  network. 

The  remainder  of  this  section  will  deal  with  the  approximate 


Fig.  7.  Delay  tail  distributions  for  T  =  10  and  various  values  of  Xmin  at 
a  single  multiplexer. 


end-to-end  performance  study.  A  network  of  five  nodes  in 
tandem  will  be  considered,  where  the  tagged  traffic  stream 
originates  as  a  periodic  source  at  node  1  with  period  T  =  5. 
The  background  traffic  at  each  node  is  assumed  to  consist  of 
ten  identical  Bernoulli  streams  each  with  rate  0.07,  resulting  in 
a  total  offered  load  of  0.9  at  each  multiplexer.  The  relatively 
high  load  at  each  node,  together  with  the  small  period  of 
T  =  5,  is  adopted  to  provide  a  scenario  in  which  significant 
delay  jitter  is  induced  into  the  tagged  cell  stream  (see  Table 
I).  The  Class-1  interarrival  time  PMF’s  at  nodes  2-5  will 
be  truncated  by  setting  e  equal  to  10”6  and  using  the  rule 
presented  in  the  previous  section.  This  choice  of  e  resulted  in 
values  of  Alm}iXf  2  <  i  <  5,  ranging  from  20-30. 

For  the  results  in  Fig.  8,  each  of  the  five  ATM  multiplexers 
serves  arriving  cells  according  to  the  FCFS  policy  ( Xmm  =  1). 
The  PMF’s  for  interdeparture  processes  at  nodes  1  and  5 
are  computed  and  compared  with  simulation  results  for  the 
identical  tandem  queueing  network.  At  the  output  of  node 
1,  the  cell  interdeparture  process  X1  has  experienced  both 
clustering  and  spreading,  but  over  35%  of  this  traffic  remains 
periodic  with  T  =  5.  As  the  tagged  stream  passes  through 
additional  FCFS  multiplexing  stages,  the  interdeparture  times 
become  more  and  more  uniformly  distributed,  and  at  the  output 
of  node  5,  there  are  no  dominant  terms  in  the  PMF.  This 
implies  that,  in  a  real-time  scenario,  significant  buffering  may 
be  required  at  the  end  user  in  order  to  achieve  an  acceptable 
playback  quality.  Also,  a  traffic  profile  such  as  that  at  the 
output  of  node  5  will  likely  induce  more  severe  queueing 
problems  (and  possibly  QoS  degradation)  for  other  users 
sharing  the  same  network  resources. 

The  validity  of  an  i.i.d.  assumption  for  cell  interdepartures 
(and  interarrivals)  is  addressed  through  the  simulation  results 
plotted  in  Fig.  8  as  discrete  points.  Clearly,  under  an  FCFS 
policy  at  each  node,  this  assumption  does  not  significantly 
affect  the  accuracy  of  the  end-to-end  study,  as  the  numerical 
results  seem  to  be  in  good  agreement  with  simulations.  Similar 
agreement  is  present  in  Figs.  9  and  10,  where  the  traffic  has 
been  shaped  by  the  peak-rate  enforcing  policy  at  each  node. 

In  Fig.  9,  each  multiplexer  enforces  a  minimum  interde- 
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Fig.  8.  Cell  interdeparture  time  PMF’s  at  nodes  1  and  5  under  FCFS  policies 
(A'min  =  1)  at  each  node;  T  —  5. 


Fig.  9.  Cell  interdeparture  time  PMF’s  at  nodes  1  and  5  for  A'm;n  =  4  at 
each  node;  T  =  5. 


parture  time  of  Xmjn  =  4,  and  the  resulting  PMF’s  for 
X1  and  Xs  are  presented.  In  both  cases,  the  tagged  stream 
conforms  to  GCRA(5,  1)  throughout  the  network.  That  is, 
all  cell  clustering  (or  negative  delay  jitter)  less  than  -1  is 
eliminated,  and  the  PMF  at  node  5  has  a  much  more  narrow 
region  of  support  than  the  corresponding  PMF  under  the  FCFS 
policy.  This  can  be  observed  in  Fig.  10,  where  three  networks 
of  servers  with  Xmxn  equal  to  1,3,  and  4  are  considered.  As 
Xmin  increases  (and  the  CDVT  parameter  r  decreases),  the 
end-to-end  performance  of  the  tagged  stream,  in  terms  of  the 
induced  distortion,  improves  dramatically.  It  is  also  interesting 
to  note  that  the  approximate  analysis  agrees  very  well  with 
simulation  results,  which  validates  the  i.i.d.  assumption  for 
cell  interdepartures  even  when  the  corrective  peak  output-rate 
enforcing  queueing  discipline  is  adopted. 

The  squared  coefficient  of  variation  for  the  Class- 1  inter- 
arrival  process  Ax  is  plotted  in  Fig.  11  as  a  function  of  the 
node  index  i  for  the  three  networks  of  servers  considered  in 
Fig.  10.  Note  that  because  the  arrival  process  to  node  1  is 


Fig.  10.  Cell  interdeparture  time  PMF’s  at  node  5  for  different  A'mjn 


Fig.  II.  Squared  coefficient  of  variation  for  the  cell  interamval  process  to 
each  node. 

periodic,  C\  =  0.0  for  node  1  in  each  case.  As  expected,  C\ 
increases  with  the  node  index  for  all  values  of  Xmin.  However, 
depending  upon  the  peak  rale  being  enforced  by  each  of  the 
servers,  the  rate  of  increase,  as  well  as  the  magnitude,  of 
varies  greatly.  Under  an  FCFS  policy  at  each  server,  the 
tagged  cell  stream  is  changing  significantly  as  it  enters  node 
6,  and  the  induced  delay  jitter  is  quite  large.  A  noticeable 
improvement  in  the  variability  of  the  tagged  cell  stream  is 
observed  when  Xmtn  increases  or  the  CDVT  parameter  r 
decreases.  In  fact,  when  Xmxn  =  4  at  each  of  the  PORE 
servers,  ca  is  always  smaller  than  0.1.  The  cumulative  mean 
queueing  delay  for  the  tagged  stream  is  plotted  as  a  function 
of  the  node  index  in  Fig.  12.  The  total  mean  delay  appears  to 
increase  linearly  with  the  node  index,  and  the  rate  of  increase 
is  clearly  dependent  upon  the  value  of  Xmin.  These  results 
indicate  that  for  real-time  applications,  the  tradeoff  between 
violating  deadline  constraints  and  violating  either  delay  jitter 
constraints  or  a  declared  peak  rate  needs  to  be  considered 
carefully. 

Note  that  while  delay  distributions  for  the  tagged  stream  are 
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computed  at  each  multiplexing  stage,  it  is  not  yet  clear  how 
the  end-to-end  delay  distribution  can  be  computed  numerically. 
By  assuming  that  the  cell  delay  process  is  independent  from 
one  node  to  the  next,  a  simple  convolution  of  individual 
delay  PMF’s  would  yield  the  approximate  end-to-end  de¬ 
lay  distribution.  Note  that  this  assumption  is  not  necessary 
for  the  computation  of  first  moments  (Fig.  12),  where  the 
mean  end-to-end  delay  is  simply  the  sum  of  individual  mean 
delays.  Although  results  are  not  presented  here,  the  above 
approximation  has  been  examined  by  comparing  numerical 
and  simulation  results.  It  seems  that  under  a  network  of  FCFS 
servers,  the  assumption  of  intemodal  independence  for  the 
delay  process  yields  quite  accurate  results.  However,  when 
peak-rate  enforcement  is  employed  at  each  multiplexing  stage, 
the  intemodal  independence  assumption  breaks  down  and  the 
convolution  of  delay  PMF’s  proves  to  be  a  poor  approximation 
for  the  end-to-end  delay  distribution. 

Finally,  it  should  be  noted  that  the  main  contribution  of  this 
work  is  to  study  the  impact  of  reregulating  on  a  traffic  stream, 
analyze  the  PORE  policy,  compare  this  policy’s  impact  to  that 
under  the  FCFS  policy,  and  present  results  for  a  perfectly 
regulated  traffic  (periodic  traffic)  to  make  the  impact  of 
reregulation  easier  to  interpret.  This  work  is  directly  applicable 
to  the  case  in  which  the  peak  rate  of  one  traffic  stream  is  being 
enforced;  this  would  be  the  case  when  one  jitter-sensitive 
traffic  stream  is  multiplexed  with  jitter-insensitive  traffic.  If 
a  small  number  of  jitter-sensitive  traffic  streams  are  present 
(or  simply,  peak  rate  is  to  be  enforced  in  a  small  number 
of  traffic  streams),  some  scheduling  conflicts  may  arise.  Due 
to  these  conflicts,  a  greater  than  desirable  spacing  may  be 
imposed  sometimes,  potentially  adding  to  the  positive  jitter 
(more  spreading),  without  affecting  the  enforced  peak  rate 
(minimum  spacing).  This  work  will  still  be  useful  in  such 
an  environment.  When  a  large  number  of  tagged  streams  are 
present,  conflicts  may  dominate,  and  although  a  peak  rate  can 
still  be  enforced,  the  positive  jitter  and  the  delay  may  be 
increased  significantly.  In  this  case,  the  relative  shifting  of 
the  periods  (if  the  streams  are  periodic)  is  expected  to  have  a 
significant  impact  [22]. 


v.  conclusions 

In  this  paper,  a  queueing  system  was  formulated  to  study  the 
distortion  induced  in  a  tagged  cell  traffic  stream;  both  FCFS 
and  peak  output  rate  enforcing  (PORE)  service  disciplines 
were  considered.  In  fact,  the  FCFS  policy  was  shown  to  be 
a  special  case  of  the  PORE  discipline.  At  the  output  of  the 
delay  jitter-reducing  service  strategy,  the  tagged  cell  stream  is 
easily  characterized  in  accordance  with  the  GCRA  specified  in 
[18].  A  discrete-time  analysis  in  the  M/G/l  paradigm  yielded 
numerical  results  for  cell  delay,  delay  jitter,  and  interdeparture 
time  probability  distributions  at  a  multiplexer  served  according 
to  the  PORE  policy. 

An  approximate  end-to-end  performance  study  of  the  tagged 
cell  stream  was  also  carried  out  by  analyzing  a  tandem 
queueing  network  of  PORE  servers.  Numerical  results  indicate 
that  a  peak  rate  enforced  at  the  network  edge  can  be  severely 
violated  within  the  network,  and  thus,  traffic  reregulation  may 
be  necessary.  As  intended,  the  traffic  profile  of  a  cell  stream 
conforming  to  GCRA(T,  r)  was  shown  to  be  much  more 
regular,  or  less  variable,  when  the  CDVT  parameter  r  is  small 
(close  to  zero).  The  tradeoff  for  enforcing  small  r  was  also 
clearly  illustrated  through  numerical  results  in  the  form  of 
increased  cell  delays. 

Simulation  results  for  the  end-to-end  tandem  queueing  study 
indicate  that  the  assumption  of  an  i.i.d.  interarrival  process, 
with  a  PMF  computed  through  the  numerical  approach,  does 
not  significantly  affect  the  accuracy  of  the  solution.  The 
good  agreement  between  simulation  and  numerical  results 
was  observed  for  a  tandem  network  of  FCFS  servers  as  well 
as  PORE  servers.  It  is  not  yet  clear  how  this  numerical 
accuracy  will  be  affected  when  a  network  of  queues  with 
cross-corTelated  traffic  and  different  routing  realizations  is 
simulated. 

VI.  APPENDIX 

SOLUTION  OF  THE  STATIONARY 
PROBABILITY  VECTOR  FOR  P 

This  Appendix  outlines  the  steps  involved  in  computing 
the  stationary  probability  vector  for  P,  given  by  tc  = 
[*0  *T  *2  ***]•  The  vectors  i  >  0,  are  the  stationary 
probability  vectors  associated  with  level  i  as  defined  in  Section 
II-D.  The  boundary  probability  vector  *o  is  evaluated  first 
based  on  the  matrix  analytic  techniques  of  Neuts  [21].  The 
vectors  iCi ,  i  >  1,  are  then  recursively  computed  from  *0* 

A.  Computation  of* 0 

A  brief  outline  of  the  steps  involved  in  computing  7To  is 
presented  here;  a  detailed  description  of  the  method  can  be 
found  in  [21].  To  apply  the  method  the  following  matrices 
need  to  be  defined: 

00 

A  =  ^  An  (18) 

n=0 

00 

G(z)=zjAnG"(z),  \z\  <  1  (19) 

n=0 
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K(*)=*EBnG"(z),  |*|  <1.  (20) 

n=0 

The  computation  of  matrix  G  =  G(z)|*-i,  which  satisfies 
the  nonlinear  matrix  equation 

oo 

G  =  ^AnGn,  (21) 

n=0 

is  a  key  step  in  the  application  of  the  technique.  Let  g  denote 
the  stationary  probability  vector  associated  with  G.  That  is,  g 
is  the  solution  to  gG  =  g  and  ge  =  1,  where  e  denotes  the 
unit  column  vector. 

Matrix  G  can  be  obtained  as  the  limit  of  the  monotonically 
increasing,  convergent  matrix  sequence  { Gjt } jt>o>  defined  by 


B.  Computation  of  i  >  1 

The  vectors  for  i  >  1,  can  now  be  computed  using  the 
recursive  scheme  attributed  to  Ramaswami  [23], 

.(I -AD"1,  i>  1  (27) 

where 

b;  =  E  B*Gfc-, 

Jt=n 

oo 

A;  =  E  A*Gfc-n,  n  >  0. 

k  —  n 


Go  =0, 

Gjt+1  =  E  AnG*  fc>0.  (22) 

n=0 

Matrix  G  is  then  used  to  determine  matrix  K,  defined  by 
K  =  K(l),  and  given  by 

K  =  E  BnG"  (23) 


with  stationary  probability  vector  it. 

Let  ac*  be  the  vector  defined  as 

.  dK(z) 
k  =  — —  e. 

dZ  Z  =  1 

The  expression  for  ac*  is  obtained  by  differentiating  (20)  at 
2  =  1: 

.  <«(*)!  „ 


=  E  B„G"e  +  ^  B„ 


^  dG(z) 

•-EG  ~dT 


Gn_1“  e 


oo  n  — 1 


=  Ke  +  E  Bn  E 


1=1  fc=0 


where  fi  =  (<iG(z)/cfz]|x=i  e  and  is  given  by 

/i=  (I-G  +  eg)[I- A  +  (e-/9)g]-1e.  (25) 

The  vector  /?  is  given  by 

f)  =  ^  nAne.  (26) 


Finally,  the  stationary  probability  vector  x0  can  be  ex¬ 
pressed  in  terms  of  the  vectors  k  and  nm  as 


K 

KKm 
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Abstract 

In  this  paper,  the  effects  of  various  time  scales  on  the  management  of  ABR  (Available  Bit 
Rate)  traffic  using  feedback  based  control  is  studied.  Since  delay  tolerable,  the  ABR  applications 
can  be  allocated  the  remaining  resources  after  CBR  (Constant  Bit  Rate)  and  VBR  (Variable  Bit 
Rate)  applications  have  been  accommodated.  To  avoid  excessive  losses  the  transmission  rate  of 
the  ABR  applications  should  be  modulated  by  the  amount  of  remaining  resources.  That  is,  the 
ABR  rate  should  be  controlled  through  a  feedback  based  rate  control  mechanism.  In  this  paper  a 
network  link  shared  by  remote  ABR  and  VBR  applications  is  considered  and  the  impact  of  various 
system  time  scales  on  the  effectiveness  of  the  feedback  based  rate  control  scheme  is  investigated, 
by  formulating  and  studying  a  simple  analytical  model.  These  time  scales  are  expressed  in  terms 
of  the  network  transmission  speed,  the  minimum  tolerable  ABR  rate  and  the  rate  of  change  of  the 
VBR  rate.  While  the  negative  impact  of  a  decreased  network  time  scale  on  the  effectiveness  of 
this  control  scheme  is  well  established,  the  impact  of  the  ABR  and  VBR  time  scales  has  not  been 
investigated  in  the  past.  It  turns  out  that  for  a  given  network  time  scale,  the  induced  cell  losses  can 
be  significantly  reduced  for  increased  ABR  and/or  VBR  time  scales  and  thus,  the  latter  time  scales 
should  be  taken  into  consideration  when  evaluating  the  effectiveness  of  an  adaptive  feedback  based 
congestion  control  mechanism.  This  study  also  suggests  that  higher  efficiency  can  be  achieved  by 
enforcing  large  ABR  time  scales,  leading  to  the  introduction  of  a  new  class  of  transmission  policies. 
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1  Introduction 


Asynchronous  Transfer  Mode  (ATM)  has  been  chosen  as  the  CCITT  standard  for  the  switch¬ 
ing  and  multiplexing  technique  of  the  Broadband-Integrated  Services  Digital  Networks  (B-ISDN) 
currently  under  development.  ATM  networks  are  expected  to  support  applications  with  diverse 
traffic  characteristics  and  Quality  of  Service  (QoS)  requirements,  such  as  data  transfer,  video, 
voice,  multi-media  conference  and  real  time  control. 

ATM  is  primarily  a  connection  oriented  protocol.  Virtual  circuits  are  set  and  the  route  is 
decided  at  connection  set  up.  Connection  oriented  services  have  typically  long  sustained  sessions 
and  cells  are  transmitted  and  delivered  in  order.  Therefore,  it  appears  like  a  dedicated  link  where 
all  cells  associated  with  a  service  follow  the  same  path.  Statistical  multiplexing  is  possible  by  on- 
demand  transmitting  cells  from  different  sources  over  the  same  line  and  storing  them  temporarily 
in  shared  buffers  within  the  network. 

The  ATM  Forum  [1]  has  defined  four  different  service  classes.  Constant  Bit  Rate  (CBR), 
Variable  Bit  Rate  (VBR),  Available  Bit  Rate  (ABR)  and  Unspecified  Bit  Rate  (UBR).  These 
classes  can  cover  a  broad  spectrum  of  diverse  applications.  Non-real  time  VBR,  ABR  and  UBR 
service  classes  are  intended  for  non-real  time  applications,  while  CBR  and  real  time  VBR  are 
intended  for  real  time  applications. 

The  ABR  service  class  is  intended  for  non-real  time  applications  which  can  tolerate  delay.  This 
service  class  simply  uses  the  remaining  capacity  left  from  VBR  and  CBR  services  and  is  a  best 
effort  service.  The  goal  of  this  service  is  to  use  the  unused  network  bandwidth  as  efficiently  as 
possible.  For  ABR  the  only  negotiated  QoS  parameter  is  the  Cell  Loss  Ratio  (CLR)  and  is  not 
necessarily  quantified.  Peak  Cell  Rate  (PCR).  CDVT  and  Minimum  Cell  Rate  (MCR)  are  the 
traffic  parameters. 

The  ABR  service  class  is  different  from  the  other  service  classes  in  many  ways  [2]  .  The 
available  bandwidth  will  be  shared  by  the  ABR  connections  and  will  change  dynamically  due  to 
the  fluctuations  in  the  VBR  and  CBR  traffic  and  it  may  reduce  to  the  MCR.  Since  VBR  and  CBR 
traffic  rates  will  fluctuate  in  time,  the  ABR  sources  should  adjust  their  rates  to  these  fluctuations 
in  order  to  utilize  the  entire  remaining  capacity  or  avoid  cell  losses. 

These  characteristics  point  to  the  necessity  of  controlling  the  flow  of  the  ABR  traffic  sources. 
Two  major  flow  control  mechanisms  for  the  ABR  service  have  been  considered:  the  rate  based 
control  [3],  [4],  [5]  and  the  credit  based  control  [6],  [7],  [8].  Since  both  of  these  control  schemes  have 
advantages  and  disadvantages,  there  were  also  studies  on  the  integration  [9]  of  both  schemes. 

The  credit  based  flow  control  is  implemented  link  by  link  and  per  virtual  circuit  (VC).  Before 
transmitting  any  cells  over  the  link,  the  sender  associated  with  the  VC  link  needs  to  receive  credits 
for  the  VC  from  the  receiver.  The  credit  indicates  the  amount  of  free  buffer  space  available  for  the 
sender.  The  sender  can  not  transmit  more  cells  than  its  credit.  If  it  uses  all  its  credits,  it  must 
wait  for  other  credits.  Because  of  this,  credit  based  flow  control  ensures  zero  cell  loss. 

In  rate  based  flow  control,  the  sender  is  notified  about  congestion  in  the  network  with  a 
feedback  generated  from  the  receiver  or  from  the  congested  nodes.  When  the  sender  receives  such  a 
feedback,  it  adjusts  its  rate  according  to  the  feedback  control  policy  in  effect.  Although  the  credit 
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based  flow  control  can  handle  both  smooth  and  bursty  traffic  (while  the  rate  based  flow  control 
may  not  be  very  effective  for  bursty  traffic),  it  introduces  an  excessive  amount  of  overhead  and  it 
would  be  very  expensive  for  a  wide  area  network.  For  such  reasons,  the  more  flexible  rate  based 
flow  control  has  been  adopted  by  the  ATM  Forum. 

Many  different  traffic  control  policies  have  been  proposed  for  the  rate  based  flow  control  such  as 
the  Explicit  Forward  Congestion  Indication  (EFCI)  [10],  Backward  Explicit  Congestion  Notification 
(BECN)  [11],  Proportional  Rate  Control  Algorithm  (PRCA)  [12],  Explicit  Rate  Feedback  [13]  and 
Enhanced  Proportional  Rate  Control  Algorithm  (EPRCA)  [14], 

If  the  propagation  delay  is  non-zero,  the  ABR  sources  will  learn  about  the  change  in  the 
rate  of  the  VBR/CBR  sources  -  that  is,  the  current  level  of  the  available  bandwidth  -  only  after 
they  have  received  the  feedback  information.  Meanwhile,  the  ABR  sources  will  transmit  at  a  rate 
which  is  based  on  the  most  recent  level  of  available  bandwidth  known  to  them.  Consequently,  the 
effectiveness  of  the  rate  based  flow  control  schemes  will  decrease  under  non-zero  propagation  delays. 

For  example,  if  the  propagation  delay  between  an  ABR  source  and  the  network  access  node  is 
Td  and  the  available  bandwidth  changes  at  time  t ,  the  ABR  source  will  learn  about  the  change  in 
bandwidth  availability  at  time  t+Td .  The  adjusted  rate  from  the  ABR  source  will  reach  the  network 
access  node  at  time  t  +  2Td,  assuming  that  the  backward  and  forward  propagation  delays  are  equal. 
This  can  be  clearly  observed  in  Figure  1  where  a  network  link  is  shared  by  CBR,  VBR  and  ABR 
applications.  Whenever  the  VBR/CBR  sources  change  their  rates,  a  bandwidth  mismatch  occurs 
for  a  duration  of  2 Td- 


Figure  1:  An  example  of  bandwidth  mismatch 

Wrhen  a  decrease  in  the  remaining  bandwidth  occurs,  the  link  capacity  is  exceeded  for  a  duration 
of  2 Td  and  overutilization  occurs.  During  an  overutilization  period,  the  number  of  cells  in  transit 
becomes  excessive  and  this  causes  cell  losses.  When  an  increase  in  the  remaining  bandwidth  occurs, 
underutilization  occurs  for  a  duration  of  2 Td-  While  the  effect  of  increased  propagation  delay  on 
the  flow  control  schemes  has  been  studied  in  the  past  [15],  [16],  [17],  almost  no  effort  [18]  has  been 
focused  on  the  study  of  the  impact  of  other  system  parameters.  In  this  paper,  the  impact  of  these 
system  parameters  (network,  ABR  and  VBR  time  scales)  on  the  feedback  based  flow  control  is 
investigated. 

The  network  time  scale  is  defined  as  the  transmission  time  of  a  cell  and  it  decreases  as  the 
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network  speed  increases.  Therefore,  decreased  network  time  scale  corresponds  to  increased  prop¬ 
agation  delay.  The  VBR  time  scale  can  be  measured  from  the  rate  of  change  of  the  VBR  source 
transmission  rate  and  the  ABR  time  scale  is  defined  as  the  minimum  distance  among  subsequent 
blocks  of  subsequent  cells  generated  by  the  ABR  source.  These  time  scales  are  defined  in  detail 
later. 

The  remainder  of  the  paper  is  organized  as  follows.  In  section  2,  the  system  considered  in 
order  to  evaluate  the  impact  of  time  scales  is  described,  In  section  3,  a  suitable  Markov  Chain  for 
the  system  is  formulated  and  some  numerical  results  are  presented  in  section  4.  The  paper  ends 
with  the  conclusions  made  in  section  5. 

2  Description  of  the  System 

Consider  a  transmission  link  shared  by  Variable  Bit  Rate  (VBR)  and  Available  Bit  Rate  (ABR) 
applications.  The  VBR  sources  are  provided  prioritized  access  to  the  resources  and  ABR  sources  are 
allocated  the  remaining  resources.  In  order  to  simplify  the  analysis  and  facilitate  the  understanding 
of  the  impact  of  the  various  time  scales,  a  system  with  one  VBR  source  and  one  ABR  source  is 
considered.  Each  of  these  sources  will  deliver  at  most  one  cell  per  slot  according  to  the  rate  in 
effect. 

The  ABR  traffic  source  is  assumed  to  be  away  from  the  network  access  node  and  thus  the 
propagation  delay  between  the  ABR  source  and  the  network  access  node  is  nonnegligible.  The 
distance  between  the  VBR  source  and  the  network  access  node  is  not  important  since  no  control  is 
applied  on  the  VBR  source.  The  ABR  source  is  controllable.  Its  transmission  rates  are  calculated 
by  the  network  access  node  by  taking  into  consideration  the  current  VBR  transmission  rates.  The 
ABR  source  rates  should  be  calculated  appropriately  in  order  not  to  cause  instability.  Therefore, 
the  total  ABR  and  VBR  source  rates  at  any  time  instant  should  not  exceed  the  link  capacity. 

The  network  access  node  is  supposed  to  be  able  to  detect  a  change  in  the  VBR  source  rates. 
This  may  be  possible  through  an  explicit  indication  that  the  VBR  source  carries  by  certain  cells 
or  an  estimation  mechanism  implemented  at  the  network  access  node.  When  a  rate  change  for  the 
VBR  sources  occur,  the  network  access  node  detects  it  and  calculates  the  appropriate  transmission 
rates  for  the  ABR  sources.  Then,  this  information  is  fed  back  to  the  ABR  sources. 

Because  of  the  nonzero  distance  d  between  the  ABR  traffic  sources  and  the  network  access 
node,  bandwidth  mismatch  will  occur  when  VBR  traffic  rate  changes  occur.  A  change  in  the  VBR 
rate  at  time  t  will  be  detected  by  the  ABR  source  at  time  t  +  Td  and  the  adjusted  ABR  rate  will 
reach  the  network  access  node  at  time  t  +  2Td-  During  the  above  round-trip  propagation  delay  2 7k, 
the  network  may  either  be  underutilized  or  overutilized;  overutilization  is  the  main  reason  causing 
cell  losses. 

The  network  access  node  is  assumed  to  have  a  finite  buffer  of  capacity  C  for  the  temporary 
storage  of  the  ABR  cells;  a  buffer  of  capacity  1  is  sufficient  for  the  temporary  storage  of  the  VBR 
cells,  since  the  one  service  per  slot  will  be  available  to  the  VBR  cell  upon  its  arrival  to  the  network 
access  node. 
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A  queuing  model  of  the  network  access  node  is  shown  in  Figure  2.  The  service  policy  is  assumed 
to  be  work  conserving  and  VBR  cell  arrivals  are  provided  head  of  line  ( HoL )  priority  as  indicated 
above.  ABR  cells  are  served  according  to  the  first  come  first  served  (FCFS)  policy.  Cell  departures 
are  assumed  to  occur  before  cell  arrivals  occurring  over  the  same  slot. 


Network  access  node 


Figure  2:  The  queuing  model  of  the  network  access  node 


2.1  The  ABR  Traffic  Source 

It  is  assumed  that  the  ABR  source  can  transmit  at  the  maximum  allowable  rate  selected  for  it  by 
the  network  access  node.  Thus,  a  heavy  traffic  assumption  is  made.  Under  this  assumption  the 
ABR  source  will  always  have  a  cell  to  transmit  with  its  allowed  rate.  Therefore,  the  worst  case 
performance  of  the  ABR  source  can  be  derived.  The  maximum  allowable  ABR  source  rate  can 
be  calculated  from  the  current  VBR  source  rate  by  the  network  access  node.  For  example,  if  the 
current  VBR  source  rate  is  equal  to  r„,  then  the  maximum  allowable  ABR  source  rate  will  be  equal 
to  1  -  rv  -  f  resulting  in  a  total  link  rate  equal  to  1  -  e.  Here,  c  is  an  arbitrarily  small  positive 
number  to  ensure  that  the  maximum  system  load  is  below  1  (stability  condition). 

In  order  to  satisfy  this  rate,  the  ABR  source  can  transmit  one  cell  per  T  slots  where  T  = 

[•  l—]  and  [()!  denotes  the  smallest  integer  larger  than  the  ().  T  is  called  the  fundamental  ABR 

subframe.  The  same  rate  can  also  be  achieved,  if  the  ABR  source  transmits  n  cells  per  subframe 
of  length  nT  slots,  that  is,  if  the  source  employs  an  n-order  subframe. 

Let  tk  denote  the  beginning  of  the  kth  subframe  of  length  BkTk.  Tk  is  equal  to  the  fundamental 
subframe  which  is  equal  to  [ } _r^  and  rVk  is  the  effective  VBR  source  rate  at  tk\  Bk  ( Bk  £  N), 
denotes  the  length  of  the  batch  transmitted  over  the  kth  subframe.  If  Bk  =  1,  the  ABR  source 
will  employ  the  fundamental  subframe  Tk  to  transmit  at  a  rate  equal  to  1/Tfc.  If  Bk  >  1,  the  ABR 
source  will  employ  a  B^-order  subframe  to  transmit  at  the  same  rate  equal  to  l/Tk.  Note  that, 
the  ABR  source  time  scale  is  equal  to  the  length  of  the  Bk-order  subframe  ( BkTk ).  Figure  3  shows 
examples  of  transmission  employing  fundamental  or  higher-order  subframes. 

Let  TC(n,  nT)  denote  a  transmission  control  policy  according  to  which  the  ABR  source  trans¬ 
mits  a  batch  of  size  n  at  the  beginning  of  a  subframe  of  length  nT .  Let  TC(T)  =  {TC(n,nT),n  >  1} 
denote  the  class  of  all  TC{n,  nT)  policies  which  implement  the  same  transmission  rate  of  1/T.  This 
class  of  policies  is  characterized  by  the  fundamental  subframe  of  length  T. 
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At  a  subframe  boundary  tk  a  fundamental  subframe  7*  (and  ABR  rate)  will  be  determined 
based  on  the  current  VBR  rate  rVk.  Any  policy  in  TC(7*)  can  be  selected  for  the  implementation  of 
the  ABR  rate  of  1  /T*.  One  of  the  objectives  of  this  paper  is  to  study  the  performance  of  the  various 
policies  within  a  given  class  (the  different  ABR  time  scales)  and  provide  guidelines  regarding  the 
selection  of  the  optimal  one. 


B^2 


Et  =k 

Figure  3:  The  ABR  traffic  subframes  for  a  transmission  rate  of  1/7* 


2.2  The  VBR  Traffic  Source 

Although  the  activity  level  of  the  VBR  traffic  source  is  not  controllable  arid  can  change  in  principle 
at  any  slot,  it  will  be  assumed  that  such  changes  occur  only  at  subframe  boundaries  for  analysis 
tractability.  Therefore,  for  the  VBR  source,  time  is  considered  to  be  subframed  and  VBR  source 
rate  changes  can  occur  anywhere  in  this  subframed  time. 

Let  {5*}*> i  be  a  2-state  underlying  Markov  chain  with  state  space  S  =  {0,1}-  {£*}*> i  will 
be  used  to  describe  the  VBR  source  activity  level.  j  is  embedded  at  subframe  boundaries 

{/*}*>]  of  length  7J*7*.  where  7*  is  determined  by  the  perceived  remaining  capacity  as  shown  in 
the  previous  subsection. 

Let  rv(, s*)  denote  the  rate  of  the  VBR  source  at  the  kth  subframe,  s*  E  5.  Without  loss  of 
generality,  it  is  assumed  that  r„(l)  >  rl/(0).  The  VBR  cell  arrival  process  is  modeled  as  a  Markov 
Modulated  Bernoulli  process.  The  probability  of  transmitting  1(0)  cell  in  a  slot  of  the  kth  subframe 
is  rv(sk)(  1  -  rv(sk)). 

Let  ps(sk)  denote  the  probability  that  5*  changes  at  the  end  of  the  kth  subframe  at  the  begin¬ 
ning  of  which  it  was  in  state  s*.  ps{sk)  will  reflect  the  average  time  over  which  VBR  applications 
maintain  a  constant  rate  (or  VBR  time  scale).  If  ps(0)  =  ps(l)  =  ps ,  then  the  VBR  time  scale  will 
be  equal  to  1/p*;  otherwise,  two  time  scales  may  be  defined. 

3  Analysis  of  the  System 

In  this  section  a  proper  Markov  Chain  describing  the  evolution  of  the  system  is  described  by  taking 
into  consideration  the  properties  of  the  ABR  and  VBR  sources  described  in  the  previous  section. 
Then,  a  numerical  approach  is  developed  for  the  evaluation  of  the  loss  performance  of  the  system. 
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Since,  no  VBR  cell  losses  will  occur,  the  analysis  is  focused  on  the  evaluation  of  the  loss  performance 
of  the  ABR  source. 

Let  {Sk,Qk}k>i  be  a  2-dimensional  process  embedded  on  {4}fc>i?  where  Qk  is  a  random 
variable  describing  the  buffer  occupancy  at  tk,  0  <  Qk  <  C .  By  definition,  Sk  determines  completely 
the  interfering  or  the  VBR  arrival  process. 

If  the  propagation  delay  Tj  is  negligible,  then  Sk  determines  the  ABR  cell  arrival  process  as 
well,  because  the  ABR  source  will  respond  to  the  VBR  source  rate  changes  immediately.  If  the 
propagation  delay  is  nonnegligible  -  as  it  is  considered  to  be  the  case  in  this  work  -  the  fundamental 

ABR  subframe  length  can  be  calculated  from  Tk  =  - t-tt — 1  where  si  is  the  VBR  state  2 Td  time 

units  earlier.  As  a  consequence,  the  current  state  Sk  does  not  determine  the  arrival  process  and 
thus  {Sk,Qk}k> l  is  not  a  Markov  chain. 

Although  a  Markov  chain  can  be  constructed  to  describe  the  evolution  of  the  system  under  fixed 
and  non-zero  roundtrip  propagation  delays,  a  simplifying  assumption  is  made  by  assuming  that  the 
roundtrip  propagation  delay  is  random  and  geometrically  distributed  with  mean  —  measured  in 
terms  of  subframes.  Given  that  a  VBR  source  rate  change  has  occured,  a  feedback  indicating 
the  VBR  source  rate  change  will  arrive  at  the  ABR  source  by  the  next  subframe  boundary  with 
probability  pj.  Note  that,  pj  is  adjusted  based  on  the  current  subframe  length  so  that  the  mean 
propagation  delay  under  both  subframe  schemes  are  equal. 

In  order  to  determine  the  arrival  process  to  the  node  over  the  kth  subframe,  the  network  access 
node  needs  to  know  the  current  VBR  source  rate  ( rv(sk ))  and  whether  the  current  ABR  rate  is 
based  on  the  most  recent  feedback  sent  by  the  node  or  not  (that  is,  whether  the  feedback’s  impact  is 
not  pending  or  pending,  respectively).  Therefore,  another  random  variable  is  introduced  describing 
the  status  of  the  impact  of  the  feedback  at  t*.. 

Let  Jk  be  an  indicator  function  which  assumes  the  value  of  1,  if  the  impact  of  a  feedback 
carrying  a  VBR  source  rate  change  is  pending  at  tk ;  that  is,  if  the  ABR  rate  arriving  at  the  node 
is  not  adjusted  yet.  If  the  impact  of  the  feedback  is  not  pending,  Jk  is  equal  to  0.  The  ABR  traffic 
rate  y~  will  be  completely  determined  by  (Sk,Jk)  and  it  is  given  by  ^  =  1  —  rv(sk  ®  jk)  —  £  where 
•r  denotes  the  modulo  2  addition;  i  ®  j  will  be  0(1)  if  i  =  j(i  ^  j ). 

Although  it  is  possible  that  at  tk  the  impact  of  more  than  one  feedbacks  is  pending,  at  most 
one  pending  feedback  will  be  considered  in  order  to  simplify  the  analysis.  This  approximation 
holds  true  if  maxSk  {ps{sk)}  pj\  that  is,  if  the  VBR  source  time  scale  is  much  larger  than  the 
propagation  delay. 

Under  the  above  assumptions,  it  is  easy  to  establish  that  the  stochastic  process  Jk,  Qk} 
embedded  at  subframe  boundaries  is  a  Markov  chain  with  state  space  {(sk,  jk^Qk)  :  0  <  s*  <  1,0  < 
Jk<hO<qk<C). 

Let  P{skjkQk<  $k+\jk+\ qk+\ )  be  the  probability  that  the  Markov  chain  {Sk,  Jk ,  Qk)  moves  from 
state  (sk, jk, Qk)  at  time  tk ,  to  state  (sk+\,jk+i,qk+i)  at  time  4+1  •  From  the  description  of  the 
system  model,  it  can  be  understood  that  (sfc+i,Jfc+i)  is  independent  from  the  queue  occupancy  qk 
and  thus  P(skjkQkm,  &k+\  jk+iQk+i )  can  be  expressed  as  follows. 

P{skjkgk;sk+\jk+\qk+\)  =  P($kjk;  sk+\jk+i)P(qk+\/skjkqk) 


6 


The  probabilities  on  the  right  side  of  the  above  equation  can  be  calculated  separately  as  shown 
below.  Note  that  P{skjk]  8k+ijk+i)  ls  the  probability  of  passing  from  (s^jk)  to  (sk+\,jk+i)  and 
P(qk+\  / SkjkQk)  is  the  probability  that  the  queue  occupancy  at  the  beginning  of  the  next  subframe 
is  qk+ 1,  given  the  current  subframe  state  (skjkqk)- 

The  state  diagram  for  the  {5/;,  Jjt}  process  is  shown  in  Figure  4.  Notice  that  not  all  state 
transitions  are  possible  because  of  the  constraint  on  jk  and  the  requirement  of  generating  a  feedback 
only  when  the  VBR  rate  changes.  Let  (skjk)  (sjt+iifc+i)  denote  a  state  transition  from  (&kjk)  to 
(sk+\  jk+\  )*  For  example,  a  (00)  -+  (10)  transition  is  impossible,  because  a  pending  feedback  has  to 
be  generated  when  a  VBR  rate  change  occurs.  This  transition  may  be  possible  only  if  the  feedback 
delay  is  zero.  Similarly,  a  transition  from  (11)  — *  (01)  can  occur  with  a  very  low  probability  and 
it  is  assumed  to  be  equal  to  0.  This  result  comes  from  the  assumption  that  the  VBR  source  time 
scale  is  much  larger  than  the  propagation  delay. 
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Let  qk,i  denote  the  queue  occupancy  at  the  end  of  the  ith  slot  of  the  kth  subframe  and 
PA(<lk,i+\/skjkqk,i)  and  PB(Qk,i+\/skjkqk,i )  denote  the  one  slot  transition  probabilities  from  qk,i 
to  qk.i+\  over  the  first  Bk  slots  of  the  kth  subframe  (where  both  ABR  and  VBR  cell  arrivals  are 
possible)  and  the  remaining  BkTk-Bk  slots  (where  only  VBR  cell  arrivals  are  possible);  respectively. 
These  probabilities  are  equal  to, 


PA(qk,i+i/siJkqk.i)  = 


P{A(sk)  =  qk,i+ 1  —  1}  Qk,i  —  0,  0  <  qk<i+i  <  C  -  1 

<  P{A(sk)  =  qk,i+i  -  qk,i}  l  <  qk,i  <  C,  o  <  g*,,+i  <  C  - 1 

,  El;Co+,‘"  P{A(sk)  =  C-qk<i  +  m}  0  <  qk>i  <  C,  0  <  qk,i+l  <  C. 

where  A(s/t)  denotes  the  number  of  cell  arrivals  from  the  VBR  traffic  source  in  a  slot  when  the 
current  VBR  state  is  sk.  Similarly, 


PB(qk,i+i/skjkqk,i) 


P { A(sk )  —  qk ,«+i }  qk,i  —  0)  0  ^  9fc,i+i  ^  C 

P{A(sk)  =  qk,i+\  ~  qk,i  +1}  1  <  qk,i  <  C,0  <  qk,i+\  <  C. 


Combining  these  two  equations  P(qk+\ /$kjkqk)  can  be  expressed  as  follows, 

c 

P(qk+\/skjkqk)  =  PA(qk,Bjskjkqk)PB{qk+\/skjkqk,B>,)- 

9*.Bk=0 


The  Bk  slot  transition  probability  PA{qk.Bk/skjkqk )  and  BkTk  —  Bk  slot  transition  probability 
P[j{qk+)  /*kjkqk.Bk )  can  be  calculated  iteratively  using  the  1  slot  transition  probabilities. 

After  the  calculation  of  the  transition  probabilities  P{skjkqk;  Sk+ijk+iqk+i),  the  steady  state 
probability  distribution  II {$k,jk-,qk)  can  be  derived  from  the  following  matrix  equations, 

II  =  UP 

X>  =  i 

l 

where  P  is  the  state  transition  matrix  and  each  of  its  elements  show  the  transition  probabilities 

P(*kjk<lk :  *k+\jk+\qk+i  )• 

In  order  to  study  the  impact  of  network  and  source  time  scales,  the  ABR  cell  loss  probability 
is  evaluated.  The  ABR  cell  loss  probability  can  be  calculated  using  the  average  number  of  cells 
arrived  at  a  subframe  in  which  {Sk,  Jk- Qk}k>\  is  in  state  ( sk,jk,qk )  and  taking  the  expectation 
over  all  possible  states.  Therefore,  the  ABR  cell  loss  probability  denoted  by  LP  is  given  as, 


LP  =  E{ 


Average  number  of  cells  lost  in  (*kjk<jt() 

Average  number  of  cells  arrived  in  state  ( SkjkQk ) 


Let  L(sk,jk.qk)  and  M(sk,jk^  Qk)  denote  the  average  number  of  cells  lost  and  arrived  over  the 
subframe  at  the  beginning  of  which  the  process  {Sk,Jk^Qk}k> l  is  in  state  ($*,  jk,qk),  respectively. 
Then, 

x  r>  tt  (  -^J($k'i  jki  qk)  ^ 

LF  =  Mtt7 - : - :/• 

Al{sk,jk,qk) 

In  order  to  calculate  L{sk*jk,qk)  and  M{sk,jk,qk),  /($*, 3k,  qk’,  i)  and  R(sk,  jk‘,  i)  can  be  intro¬ 
duced.  Let  I{sk,jk-.qk\i){R{sk',jk-.qk\i))  denote  the  number  of  cells  lost(arrived)  at  the  ith  slot  of 
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the  subframe  at  the  beginning  of  which  the  system  is  in  state  (skjkqk)',  1  <  *  <  BkTk-  R(sk,jk,  <}k\ *) 
does  not  depend  on  the  queue  occupancy  and  is  a  function  of  Sk,  jk  and  i  since  Tk  =  f(sk,jk)- 
Therefore, 


R(sk,jk\  i) 


A(sk)  if  i  >  Bk 
A(sk)  +  1  if  i  <  Bk ■ 


The  number  of  lost  cells  can  be  found  by  considering  the  number  of  arrivals,  the  buffer  occu¬ 
pancy  in  the  previous  slot  and  the  buffer  capacity  C.  Therefore, 


I(sk,jk,qk‘,i) 


m  if  R(sk,jk ;  i)  =  C  +  1  ~.qk,i- 1  +  m 

0  otherwise  . 


Clearly,  the  number  of  cell  arrivals  over  a  subframe  at  the  beginning  of  which  the  system  is  in 
state  ( SkjkQk )  is  equal  to  the  summation  of  R(sk,jk ;  *)  for  all  slots  i  of  this  subframe.  Thus, 

BkTk 

M(sk-jk-qk)  =  X  R(skJk.qk) 

i  =  l 

=  t(A(sk)  +  l)+  I ;  A(sk) 

t=l 

=  Bk  +  BkTkA(sk). 


Taking  the  expectation  of  both  sides. 


M(*k-jk-qk)  =  E{Bk  +  BkTkA(sk)}  =  Bk  +  BkTkrv(sk). 


The  total  number  of  cells  lost  over  the  entire  subframe  is  equal  to  the  summation  of  I($k-jk ■  qk\  i) 
for  all  slots  i  of  this  subframe. 

BkTk 

L(sk-jk-qk)  =  X'<  Skjk,qk-i)- 

i=i 

Taking  the  expectation  of  both  sides, 

BkTk 

L(»kJk.qk)=  X  B{l(skJk,qk-i)}. 

l-\ 

The  above  equation  can  be  divided  into  two  terms,  the  first  term  corresponding  to  the  first 
part  of  the  subframe  (when  both  ABR  and  VBR  cell  arrivals  are  possible)  and  the  second  term 
corresponding  to  the  second  part  of  the  subframe  (when  only  VBR  cell  arrivals  are  possible).  Thus, 
_  Bk 

L(sk.jk-qk)  =  ^2E{I{sk,jk.qk-i)} 

t=i 

BkTk 

+  X  E{I(sk,jk,qk\i)}- 

i=Bk+ 1 
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By  employing  the  definitions  for  I(sk,jk,qk‘,i )  and  R(sk,  jk',i )  the  following  expression  is  ob¬ 
tained, 

_  B* 

L{sk,jk,qk)  =  EE™^"4^  =  C -qk,i-i  +  m/qk,sk,  jk} 

1  =  1  m 
BkTk 

+  E  ^2mP{A(sk)  =  C  +  1  -  qkii^i  +  m/qk,sk,jk}. 

t=£k+l  m 


Since  there  are  at  most  one  arrival  per  slot  after  the  first  B *  slots  of  each  subframe,  cell  loss 
may  occur  only  in  the  first  Bk  slots.  Therefore, 

__  Bk 

L{sk,jk,  qk)  =  J2J2mP{A(sk)  =  C  -  qk>i-i  +  m/qk,  sk,  jk} 

i~  1  m 

and 


Bk  c  l-C+l 

L(skJk^k)  =  EE  E  mP{A(sk)  =  C  -  /+  m/qk,skJk}P{qki_i  =  l/qk0  =  qk}. 

:  =  1  /  =  0  m  =  0 


Combining  L(sk,jk,qk)  and  M(sk,jk,Qk)  equations  obtained  above,  the  ABR  cell  loss  proba¬ 
bility,  LP ,  can  be  expressed  as  follows, 


LP  = 


x 

x 


1  1  c 


LEE 


R(sk,jk<qic) 

Bk  -f  rv(sk)BkTk 


Bk  c  i-c +1 

EE  E  mP{A(sk)  =  C  -  I  +  rn/qk,sk,jk} 

*=1  /=0  m  =  0 

P{<ik.i-i  -  l/qk.o  =  qk}- 


4  Numerical  Results 

In  this  section,  some  numerical  results  are  presented  to  illustrate  the  impact  of  network  and  source 
time  scales  on  the  performance  of  the  ABR  source.  While  a  large  propagation  delay  is  expected  to 
impact  negatively  on  the  performance  of  the  system,  the  opposite  impact  is  expected  from  large 
ABR  and  VBR  time  scales. 

The  performance  of  the  queuing  system  in  terms  of  cell  loss  probability  for  the  ABR  source 
is  illustrated  in  Figure  5.  The  loss  probability  is  plotted  as  a  function  of  pj  and  the  results  are 
derived  for  a  buffer  capacity  C  of  40  and  VBR  source  rates  of  0.4  and  0.8  cells/slot.  For  this  plot 
the  TC(l.Tjt)  transmission  policy  is  used  for  the  ABR  source.  As  the  propagation  delay  decreases 
(pj  increases),  the  amount  of  time  that  it  takes  for  the  ABR  source  to  respond  to  the  VBR  source 
rate  changes  decreases.  As  a  consequence,  the  duration  of  bandwidth  mismatch  decreases,  causing 
a  decrease  in  the  ABR  cell  loss  probability.  As  the  VBR  time  scale  decreases  ( ps  increases)  , 
VBR  source  rate  changes  occur  more  frequently.  As  a  result,  more  bandwidth  mismatch  cycles 
are  initiated  resulting  in  more  losses.  Figure  5  shows  the  ABR  loss  probability  for  ps  =  0.005, 
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ps  =  0.001,  ps  =  0.0005,  ps  —  0.0001  where  ps  =  ps( 0)  =  ps(l).  Note  that  ps  is  kept  below  0.005 
due  to  the  requirement  that  it  should  be  much  less  than  pj  in  order  for  the  assumption  of  no 
multiple  feedbacks  to  be  reasonably  accurate. 

In  Figures  6  and  7,  the  impact  of  the  ABR  time  scale  is  illustrated.  The  ABR  loss  probability 
is  plotted  against  the  batch  size  for  various  values  of  buffer  capacities.  The  VBR  rates  are  kept 
equal  to  0.4  and  0.8  cells/slot.  The  average  distance  between  the  ABR  source  and  the  network 
access  node  is  assumed  to  be  10000  subframes  (thus  pj  —  10-4).  Note  that,  the  ABR  time  scale 
(subframe  lengths)  is  directly  proportional  to  batch  sizes,  since  subframe  lengths  are  equal  to 
BkTk .  For  example,  for  this  particular  example  a  batch  size  of  7  corresponds  to  subframe  lengths 
of  14  and  42,  where  the  corresponding  fundamental  subframe  lengths  are  2  and  6.  Because  of 
the  single  outstanding  feedback  assumption,  ps  is  kept  much  smaller  than  pj  ( ps  -  10“6).  The 
ABR  loss  probability  vs.  the  batch  size  is  plotted  in  Figure  6  for  C  -  10,15,20,25,30.  It  can  be 
clearly  observed  that  for  a  fixed  value  of  C,  the  loss  probability  initially  decreases  as  the  batch  size 
increases.  This  behavior  is  reversed  when  the  batch  size  exceeds  a  threshold.  Thus,  for  a  given 
buffer  capacity  C,  there  is  an  optimal  batch  size  that  minimizes  the  induced  ABR  cell  losses.  These 
observations  can  be  made  in  Figure  7  where  results  for  a  larger  range  of  batch  sizes  are  plotted  for 

C  =  10.20. 

The  reduced  ABR  losses  for  batch  sizes  greater  than  one  may  be  attributed  to  the  associated 
increased  ABR  source  time  scale.  This  positive  impact  prevails  as  long  as  the  buffer  capacity  C  is 
large  enough  to  absorb  the  increased  batches.  When  the  batch  size  exceeds  a  threshold,  it  cannot 
be  effectively  absorbed  by  the  fixed  buffer  size,  resulting  in  losses  which  are  not  compensated  for 
by  the  benefits  from  the  increased  time  scale.  The  larger  the  value  of  C,  the  larger  this  threshold 
is  expected  to  be.  as  it  is  clearly  observed  in  Figures  6  and  7. 

The  impact  of  the  employed  ABR  time  scale  (or  batch  size)  on  the  system  performance  is 
expected  to  be  affected  by  the  propagation  delays.  Figures  8  and  9  illustrate  the  impact  of  the 
propagation  delay  on  the  performance  of  the  system.  Figure  8  is  obtained  for  an  average  distance 
between  the  ABR  source  and  the  network  access  node  equal  to  1000  subframes  (thus  pj  =  10~3). 
Figure  9  is  obtained  for  an  average  distance  of  100  subframes  (thus  pj  =  10-2).  As  expected,  the 
performance  improves  under  all  ABR  time  scales  as  the  distance  decreases,  because  the  amount  of 
bandwidth  mismatch  is  proportional  to  the  distance. 

By  comparing  the  plots  for  C  =  10  in  Figures  6.  8.  9  which  have  been  obtained  for  an  average 
propagation  distance  of  20000.  2000  and  200  slots,  respectively,  it  may  be  concluded  that  the 
optimal  batch  size  decreases  with  the  propagation  distance.  For  a  propagation  distance  of  200  slots 
and  C  =  10  the  optimal  batch  size  is  as  low  as  1.  This  trend  may  be  attributed  to  the  decreasing 
positive  impact  of  a  large  ABR  time  scale  (batch  size)  as  the  propagation  delay  decreases.  This 
positive  impact  cannot  compensate  for  the  negative  impact  of  a  larger  batch  size,  driving  the 
optimal  batch  size  to  lower  values. 

The  above  can  be  more  clearly  observed  in  Figure  10  where  the  optimal  batch  size  against  the 
buffer  capacity  is  plotted  for  average  distances  of  20000  and  2000  slots.  It  can  be  established  that  for 
large  propagation  distances,  the  optimal  batch  size  is  larger  than  1  even  for  small  buffer  capacities. 
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For  small  propagation  distances  and  small  buffer  capacities,  better  performance  is  achieved.  Note 
that  the  optimal  batch  size  seems  to  be  linearly  increasing  with  the  buffer  capacity. 

In  addition  to  the  network  and  source  time  scales,  the  VBR  source  rates  also  have  considerable 
effects  on  the  performance.  Figure  11  shows  the  affects  of  the  VBR  source  rates  on  the  loss 
probability.  For  a  meaningful  comparison,  the  mean  VBR  arrival  rate  is  kept  constant  for  the  three 
cases  considered.  The  VBR  source  rates  for  the  three  cases  are  0.9  and  0.3,  0.8  and  0.4.  0.7  and  0.5 
cells/slot.  It  can  be  clearly  observed  that  the  loss  performance  improves  as  the  difference  between 
the  two  rates  decreases.  For  the  case  with  rv(0)  =  0.3  and  rv(l)  =  0.9,  the  subframe  lengths  are  2 
and  11  slots.  These  subframe  lengths  correspond  to  ABR  source  rates  of  0.5  and  0.091  respectively. 
Therefore,  during  a  bandwidth  mismatch  initiated  by  a  VBR  source  rate  change,  the  total  rate  on 
the  link  can  increase  up  to  1.4  causing  potentially  high  losses.  When  the  VBR  source  rate  drops 
from  0.9  to  0.3,  the  total  rate  in  the  link  decreases  to  0.391  causing  excessive  underutilization. 
However,  when  VBR  rates  are  0.4  and  0.8,  the  corresponding  ABR  rates  will  be  0.5  and  0.167. 
During  a  bandwidth  mismatch  the  total  rate  on  the  link  will  be  at  most  1.3  or  at  least  0.567. 
Compared  to  1.4  and  0.391,  the  overutilization  and  underutilization  in  this  case  are  decreased. 
Therefore,  a  decrease  in  the  ABR  loss  probabilities  occurs. 

5  Conclusions 

In  this  paper,  the  impact  of  network  and  source  time  scales  on  the  performance  of  ABR  applications 
is  studied.  The  network  time  scale  is  defined  as  the  transmission  time  of  a  cell.  The  VBR  time 
scale  is  defined  in  terms  of  the  rate  of  change  of  the  VBR  source  rate.  The  ABR  time  scale  is 
defined  to  he  the  minimum  distance  between  consecutive  blocks  of  consecutive  cells  generated  by 
the  ABR  source.  In  order  to  evaluate  the  impact  of  these  time  scales,  a  system  with  one  ABR  and 
one  VBR  source  is  considered.  (A  system  with  multiple  ABR  sources  also  gave  similar  results.) 

While  the  impact  of  the  network  time  scale  has  been  considered  in  detail  in  the  past,  that  of 
other  relevant  time  scales  has  not  been  investigated.  The  main  contribution  of  this  paper  is  the 
study  of  the  impact  of  the  VBR  and  ABR  time  scales  on  feedback  based  flow  control. 

The  increased  VBR  time  scale  has  a  positive  impact  on  the  ABR  loss  performance.  This  is 
due  to  the  fact  that,  less  frequent  VBR  rate  changes  (or  bandwidth  mismatch  cycles)  occur  when 
the  VBR  time  scale  increases,  resulting  in  reduced  ABR  cell  losses.  Increasing  the  ABR  time  scale 
(or  the  batch  size),  initially  decreases  the  loss  probability.  After  the  batch  size  exceeds  a  threshold, 
the  situation  is  reversed.  The  optimal  batch  size  -defined  to  be  the  one  corresponding  to  this 
threshold-  increases  with  the  buffer  capacity,  as  the  capability  of  absorbing  larger  batches  by  the 
buffer  increases  with  increased  buffer  space.  For  large  propagation  distances  the  optimal  batch  size 
is  greater  than  one,  even  for  small  buffer  capacities.  As  the  propagation  distance  decreases,  the 
optimal  batch  size  decreases  as  well.  This  may  be  due  to  the  reduced  positive  impact  of  the  ABR 
time  scale  for  decreased  propagation  delay. 

The  major  conclusion  of  this  work  is  that  in  addition  to  the  network  time  scale,  the  ABR  and 
VBR  time  scales  may  impact  substantially  a  feedback  based  flow  control  as  well.  Thus  the  impact 
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of  all  system  time  scales  should  be  considered  in  order  to  accurately  evaluate  the  effectiveness  of 
a  flow  control  algorithm.  Also,  based  on  this  study  a  new  class  of  transmission  polices  for  the 
ABR  sources  is  defined  which  may  result  in  better  system  efficiency.  Thus,  departing  from  the 
periodic  Bk  —  1  policy,  Bk  =  n  (n  >  1)  policies  may  be  more  beneficial.  Further  study  will  focus 
on  development  of  a  new  framework  for  ABR  traffic  management  based  on  this  new  policy. 
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Figure  6:  ABR  cell  loss  probability  vs.  batch  size  for  various  values  of 


Figure  7:  ABR  cell  loss  probability  vs.  batch  size  for  C  —  10  and  C  = 
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Ceil  Loss  Probability 


Figure  10:  Optimum  batch  size  vs.  buffer  capacity 


Figure  11:  ABR  cell  loss  probability  vs.  ps  for  various  VBR  arrival  rates 
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Abstract 

In  Wireless  ATM  networks  transmission  resources  are  shared  among  geographically  disperse 
applications  with  diverse  Quality  of  Service  (QoS)  requirements  and  traffic  characteristics.  To 
provide  QoS  guarantees  and  use  the  bandwidth  efficiently,  call  admission  and  scheduling  func¬ 
tions  are  necessary.  These  functions  should  ensure  the  delivery  of  the  target  QoS  to  the  sup¬ 
ported  applications  while  achieving  statistical  multiplexing  gains,  without  explicit  and  continu¬ 
ous  exchange 
In  this 

heterogeneous  Variable  Bit  Rate  (VBR)  applications  with  diverse  QoS  requirements  is  addressed. 
The  QoS  requirements  for  each  application  is  defined  in  terms  of  a  maximum  tolerable  packet 
delay  and  dropping  probability;  a  packet  is  dropped  if  it  experiences  excess  delay.  The  region 
of  achievable  QoS  vectors  is  established  for  policies  that  are  work-conserving  and  satisfy  the 
I  earliest  due  date  (EDD)  service  criterion  (WC-EDD  policies);  such  policies  are  known  to  optimize 
the  overall  system  performance.  In  addition  to  the  determination  of  the  region  of  achievable 
I  QoS  vectors,  this  study  leads  also  to  the  construction  of  scheduling  policies  which  deliver  any 
/  performance  in  the  region  established  for  WC-EDD  policies.  Finally,  an  upper  bound  on  the 

/  region  of  QoS  vectors  that  can  be  achieved  under  any  policy  (not  limited  to  the  WC-EDD 

_ policies)  is  determined. 
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1  Introduction 


The  next-generation  wireless  networks  are  expected  to  offer  a  range  of  services  to  support  inte¬ 
grated  applications  such  as,  voice,  video,  multi-media,  and  conventional  data.  In  addition,  this  new 
architecture  should  be  compatible  with  the  prevalent  network  architecture  for  integrated  services 
over  fiber/copper  based  channels,  the  Asynchronous  Transfer  Mode  (ATM).  It  would  only  seem 
natural  to  extend  the  ATM  protocol  into  the  wireless  environment,  but  this  environment  poses 
many  challenges  for  ATM,  such  as  an  unreliable  wireless  link  and  a  multi-access  up-link  channel. 
Recently,  system  architectures  have  been  proposed  to  enable  “Wireless  ATM”  (WATM)  [1.  2.  3]. 
These  systems  employ  a  Data  Link  Control  (DLC)  layer,  to  combat  the  unreliability  of  the  wireless 
link,  and  a  medium  access  control  (MAC)  protocol,  to  organize  the  sharing  of  the  multi-access 
channel  (wireless  up-link).  MACs  for  WATM  have  been  examined  in  [1,  4,  5.  6.  7]  and  all  employ 
TDMA  with  on  demand  assignment  of  the  transmission  resources.  Allocating  resources  on  demand 
for  Variable  Bit  Rate  (VBR)  sources  while  a  call  is  in  progress,  statistical  multiplexing  gains  are 
achieved,  results  in  efficient  use  of  the  available  bandwidth. 

Due  to  the  lack  of  resources  in  a  wireless  network,  complete  continuous  information  of  the  status 
of  tlio  distributed  source  packets  may  not  be  known.  The  information  needed  to  assign  transmissions 
may  only  be  updated  periodically  or  at  discrete  moments,  as  in  [1,  4,  5.  6,  7].  These  systems 
use  a  combination  of  control  channels,  piggy-backing  on  information  bearing  packets  and  polling 
procedures  to  communicate  service  request  and  transmission  assignments  between  the  scheduler 
and  source.  In  this  environment  a  MAC  protocol  should  determine  a  transmission  scheduling 
policy  in  a  way  that  the  probability  of  violating  the  maximum  tolerable  packet  delay  -  which  leads 
to  packet  dropping  rate  because  of  discarding  packets  with  excess  delay  -  does  not  exceed  the 
required  maximum  packet  dropping  probability.  At  the  same  time  a  call  admission  function  should 
ensure  that  the  requesting  service  from  all  of  the  supported  applications  is  possible  to  deliver.  The 
maximum  packet  delay  and  packet  dropping  probability  are  the  critical  QoS  parameters  for  real¬ 
time  oriented  services.  The  call  admission  and  scheduling  functions  ensure  the  delivery  of  target 
QoS  to  the  supported  applications  while  achieving  statistical  multiplexing  gains. 

In  this  paper,  call  admission  and  transmission  scheduling  policies  are  considered  for  a  TDMA 
system,  where  service  requests  are  processed  at  frame  boundaries.  This  resource  structure  has 
been  widely  considered  in  both  cellular  systems  [8]  and  wireless  LANs  [9],  as  well  as  in  recent 
work  toward  the  development  of  wireless  ATM  networks  [1].  Transmission  scheduling  policies  for 
this  environment  were  developed  in  [10]  to  accommodate  heterogeneous  Variable  Bit  Rate  (VBR) 
applications  with  diverse  packet  dropping  requirements.  In  that  work,  packets  which  are  not 
serviced  in  the  frame  following  their  arrival  are  considered  to  have  excess  delay  and  are  dropped. 
That  is,  all  packets  assume  a  common  maximum  tolerable  delay  of  one  frame.  In  the  present 
work,  the  packets  are  considered  to  have  diverse  maximum  delay  as  well  as  dropping  probability 
requirements.  As  it  becomes  apparent  in  this  paper,  the  added  diversity  regarding  the  maximum 
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delay  creates  a  number  of  issues.  Some  of  these  issues  are  addressed  by  considering  a  specific  family 
of  policies,  as  discussed  in  the  main  body  of  this  paper. 

In  the  next  section,  the  system  model  considered  in  this  work  is  described.  In  section  3,  the 
region  of  achievable  QoS  vectors  is  established  for  policies  that  are  work- conserving  and  satisfy  the 
earliest  due  date  (EDD)  service  criterion  (WC-EDD  policies).  Determining  this  region  is  central 
to  the  call  admission  control  problem  as  well  as  the  design  of  effective  transmission  scheduling 
algorithms.  In  section  4,  the  probability  distribution  of  the  residual  traffic  process  is  derived  for 
arrival  processes  modeled  in  terms  of  a  sequence  of  i.i.d.  random  variables.  Section  5  examines 
a  class  of  policies  that  can  deliver  any  achievable  WC-EDD  performance.  An  upper  bound  on 
the  region  of  QoS  vectors  that  can  be  achieved  under  any  policy  (not  limited  to  the  WC-EDD 
policies)  is  determined  in  section  6.  The  region  of  acceptable  QoS  vectors  is  examined  in  section 
7.  Numerical  examples  are  presented  in  section  8  and  the  conclusions  of  this  work  are  contained  in 
section  9. 

2  System  Model 

In  this  paper,  the  problem  of  sharing  T  resources  by  N  heterogeneous  Variable  Bit  Rate  (VBR) 
source  applications  is  considered.  The  source  packet  arrival  process  is  described  in  terms  of  a 
general  arrival  process  embedded  at  the  boundaries  of  fixed  length  intervals  called  service  cycles 
(or  frames).  No  additional  assumptions  for  the  packet  arrival  process  are  necessary  at  this  point. 
Up  to  T  packets  may  be  transmitted  (served)  during  each  service  cycle.  Depending  on  the  Quality 
of  Service  (QoS)  requirements,  packets  which  cannot  be  transmitted  over  the  service  cycle  following 
their  arrival  may  be  dropped  (due  to  delay  violation)  or  may  be  delayed  to  compete  for  service 
in  the  next  frame.  A  TDMA  system  in  which  arrivals  are  considered  at  frame  boundaries  may  be 
modeled  in  terms  of  a  discrete  time  system  in  which  packet  delays  are  measured  in  frames  (1  Frame 
=  L  time  units).  In  the  system  considered  in  this  work,  the  N  VBR  sources  are  partitioned  into 
two  classes.  S\  =  {1.2,....  A'}  and  S2  =  {A'  +  1,  K  +  2, . . . ,  N}.  Packets  generated  from  sources 
in  S\  have  a  common  maximum  delay  tolerance  of  L  time  units  (1  frame)  and  packets  generated 
from  sources  in  52  have  a  common  maximum  delay  tolerance  of  2 L  time  units  (2  frames). 

Let  A i(n).  i  £  {5i  U52}  denote  the  number  of  newly  generated  packets  requesting  service  from 
sources  in  S\  and  S2  at  the  nth  frame  boundary.  The  aggregate  traffic  from  sources  in  5]  is  given 
by  A 5j  ( ft )  =  A i(n),  and  has  a  delay  tolerance  of  L  time  units,  thus,  must  be  either  serviced 

or  dropped  over  frame  n.  Newly  generated  requests  from  sources  in  52-  A s2(n)  =  X^es2  ^(n)’  maY 
be  either  serviced  or  delayed  to  the  next  frame  to  compete  for  service.  Packets  from  52  that  have 
been  delayed  (and  must  be  serviced  or  dropped  in  the  current  frame)  are  denoted  as,  A (n)\  the 
superscript  r  is  used  to  indicate  residual  traffic  (those  packets  from  A 2(n  —  1),  i  E  52  not  serviced) 
and  the  superscript  /  indicates  its  dependency  on  the  service  policy  /.  Thus,  the  total  residual 
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traffic  from  sources  in  class  52  requiring  service  in  frame  n  is  given  by, 

The  number  of  packets  from  source  i  that  are  dropped  under  policy  /  during  frame  n  are  given 


by, 

d{(n)  =  X i(n)-a{(n),  i  G  Si  =  {1,2, . . K},  (1) 

d{(n)  =  X ^(n) -«*{,(!*),  i  E.  S2  =  {K  +  1,  A'  +  2, . . . ,  N};  (2) 


a{(n)  denotes  the  number  of  packets  from  source  i  serviced  during  frame  n  under  policy  /;  a/j(n), 
i  E  S2.  denotes  the  amount  of  service  (in  slots)  provided  to  the  residual  traffic  associated  with 
source  i  during  frame  n.  Since  in  anv  frame  two  types  of  packet  from  a  source  in  52  may  be  present 
(new  arrivals  and  residual  traffic),  the  total  amount  of  resources  devoted  to  source  i  E  S2  in  frame 
n.  is  given  by, 

a{ (n)  =  a{A(n )  +  a{2(n),  i  G  52,  (3) 

where  a{2(n)  is  the  amount  of  available  resources  allocated  to  the  new  arrivals  from  S2  in  frame  n 
under  some  policy  /.  Since  the  residual  traffic  in  frame  n  consist  of  the  new  arrivals  from  sources 
in  S2  which  did  not  receive  service  in  frame  (n  —  1),  the  residual  traffic  in  frame  n  under  policy  / 
is  given  by. 

Xri'f(n)  =  Xi(n  -  1)  -  a{2(n  -  1)  i  G  52.  (4) 

Lot  dj  =  E  |</{(n)j.  a{  —  E  Ja{(u)j  and  A,  =  E'fAdn)]  be  the  (assumed  time  invariant)  expected 
values  of  the  associated  quantities.  The  residual  traffic  is  illustrated  in  the  realization  depicted  in 
Fig.  1. 


- - T - - 

n  n+1  n+2 


X  (n)  :  New  arrivals  from  class  S  j 

:  Residual  traffic  from  class  S  2 

2 

X  (n)  :  New  arrivals  from  class  5  2 
Figure  1:  Realization  of  residual  traffic. 
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If  a  Work  Conserving  Earliest  Due  Date  (WC-EDD)  policy  is  employed,  then  four  out  of  the 
five  packets  with  a  service  deadline  (or  due  date)  in  the  current  frame  will  be  served,  the  remaining 
one  will  be  dropped,  and  the  new  arrival  from  class  52  will  form  the  residual  traffic  in  the  next 
frame  (frame  ( n  +  1)).  According  to  the  WC-EDD  policy,  all  three  packets  with  service  deadline 
in  frame  (n  +  1)  will  be  served  during  the  frame,  as  well  as  one  of  the  two  packets  with  service 
deadline  in  frame  (n  +  2). 

From  the  above  example  it  is  evident  that  the  employed  EDD  policy  imposes  restrictions  on  the 
level  of  QoS  diversification  that  could  be  achieved  otherwise.  For  instance,  new  arrivals  from  class 
52  cannot  be  serviced  in  the  presence  of  packets  from  class  5i,  imposing  a  limit  on  the  minimum 
dropping  rate  for  sources  in  class  52.  This  limit  is  higher  than  the  dropping  rate  achieved  if,  for 
instance,  all  packets  (new  and  residual)  from  sources  in  class  52  had  service  priority  over  those  in 
Si. 

In  most  of  this  paper,  the  class  of  WC-EDD  polices  is  considered  for  the  following  reasons. 
First,  the  WC-EDD  policies  are  known  to  minimize  the  system  packet  dropping  (delay  violation) 
probability  [11],  resulting  in  throughput  maximization.  Unlike  a  more  general  case  in  which  the 
service  deadlines  or  due  dates  would  form  a  continuum  or  may  be  drawn  from  a  large  collection 
of  values,  only  two  service  deadlines  are  considered  in  the  TDMA  environment  in  this  work.  As 
a  consequence,  a  potentially  large  number  of  packets  from  different  sources  will  have  identical 
service  deadlines  or  due  dates  (one  of  two  values)  and,  thus,  significant  room  for  dropping  rate 
diversification  may  be  possible  without  departing  from  the  WC-EDD  policies.  In  addition,  it  is 
possible  to  determine  the  region  of  achievable  QoS  vectors  under  any  WC-EDD  policy,  as  well  as 
scheduling  policies  delivering  any  QoS  vector  in  this  region. 

If  the  QoS  vector  is  not  in  the  region  of  achievable  QoS  vectors  under  the  WC-EDD  policies, 
it  can  be  concluded  that  such  level  of  QoS  diversification  may  be  achieved  only  at  the  expense 
of  system  throughput[12].  This  may  suggest  that  the  sharing  of  the  resources  by  such  diverse 
applications  may  need  to  be  restricted  by  allowing  for  resource  sharing  by  less  diverse  applications. 
In  any  case,  by  deriving  an  upper  bound  of  the  region  of  achievable  QoS  vectors  under  any  WC 
policy,  it  can  be  determined  whether  a  given  QoS  vector  is  not  achievable. 

In  this  work  the  QoS  requirements  of  application  i  is  described  in  terms  of  a  maximum  tol¬ 
erable  delay  and  a  maximum  dropping  probability  this  is  the  probability  that  a  packet  from 
source  i  experiences  a  delay  greater  than  its  maximum  tolerable  delay  and,  thus,  is  dropped.  The 
corresponding  packet  dropping  rate  or  delay  violation  rate,  d{  (measured  in  expected  number  of 
dropped  packets  per  frame)  is  easily  determined  by,  di  =  0  <  i  <  N .  In  the  rest  of  this 

paper  the  QoS  vector  associated  with  the  supported  applications  will  be  described  in  terms  of  the 
dropping  rates,  with  the  understanding  that  these  rates  are  induced  due  to  violation  of  diverse 
delay  tolerances.  The  QoS  vector  associated  with  the  supported  applications  can  be  defined  in 
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terms  of  the  (performance)  packet  dropping  rate  vector  d, 

d  =  (^i  ?  c?2  +  1,...,  rfyv )  -  (5) 

The  first  question  addressed  in  the  sequel  (section  3)  is  whether  a  given  QoS  vectors  d  is 
achievable  under  any  WC-EDD  policy  /.  Necessary  and  sufficient  conditions  are  derived  in  order 
for  the  QoS  vector  to  be  achievable  under  these  policies,  leading  to  the  precise  determination  of  the 
region  of  achievable  QoS  vectors  d.  The  second  question,  addressed  in  section  5,  is  concerned  with 
the  design  of  scheduling  policies  that  deliver  any  achievable  QoS  vector  d.  Also,  an  upper  bound 
of  the  region  of  QoS  vectors  achieved  under  any  WC  policy  (not  necessarily  a  WC-EDD  policy)  is 
determined  in  section  6. 

3  Region  of  Achievable  QoS  Vectors  Under  WC-EDD  Policies 

3.1  Conservation  Law  and  Inequality  Constraints 

A  formal  definition  of  a  work-conserving  scheduling  policy  for  the  system  described  in  the  previous 
section  is  given  first. 

Definition  3.1  A  scheduling  policy  f.  is  work-conserving  iff  it  satisfies  the  following  conditions: 

I\  X 

XM(")+  E  A;J(r?  +  l)  =  0  if  ASl(n)+ A^/(n)  +  As2(n)  <  T  (6) 

t=l  t  =  /\+ 1 

X 

E ai(n)  =  T  if  As1(n)  + A5/(n)  + As2(n)  >  T  .  □  (7) 

1  =  1 


A  work-conserving  policy  does  not  waste  resources  (slots)  as  long  as  there  is  work  to  perform 
(packets  to  transmit).  Let  S  =  {5i  U  52}  be  the  set  of  all  sources  and  denote  the  average 

system  packet  dropping  rate  under  scheduling  policy  /,  given  by, 


d f  —  E 

fl{5}  “  tL 


X 


Li=l 


=E£K<»)]=i>f 


1=1 


i=l 


(8) 


Definition  3.2  Let  T  be  the  family  of  Work-Conserving  Earliest  Due  Date  (WC-EDD)  policies. 

□ 


Definition  3.3  Frame  n  is  said  to  be  under-loaded  when  Asj(n)  +  A^(n)  <  T  and  overloaded  when 
As,(h)  +  A Tsf2(n)>T.  □ 


Notice  that  during  an  overloaded  frame,  packets  will  be  dropped  and  none  of  the  As2(n)  packets 
will  receive  service  under  any  policy  /  G  T .  Moreover,  for  any  /  G  T  the  following  hold: 


=  0  ifAS](n)  +  A5/(n)<r 
>0  if  ASl(n)  +  A5/(n)  >T 


(9) 
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and, 


A[’/(n  +  l)  = 


Xi{n)  +  X^in)-  a{(n)  if  ASl(n)  +  A^(»)  <  T  £  ^ 
Xi(n)  if  A Sl(»)  +  A^(n)  >  T 


(10) 


Theorem  3.1  The  system  dropping  rate ,  25  conserved  under  any  f  £  J~  and  is  a  lower  bound 

on  the  system  dropping  rate  induced  under  any  policy.  More  specifically , 


d[s}  i  {e  [a5»  +  A £'(»)  |  ASl(n)  +  A gf(n)  >  t\  -  t)  P  (a5»  +  A^ (n)  >  t) 

V/GJT.  □ 


(11) 


A  ; 

=  b{S } 


Proof.  First,  the  first  equality  in  (11)  is  proved.  Summing  (1)  and  (2)  over  all  i  £  5,  the  following 
is  obtained: 


5M(«)  =  E  **•(")+  51  AiJ(n)-  5^cf(n)-  51  (i2) 

i€S  z£S]  ^6*^2 

Since  /  is  a  WC-EDD  policy  it  satisfies  (6)  and  (7)  of  Definition  3.1  and  (9)  and  (10)  of  Definition 
3.2.  Thus,  the  above  expression  becomes. 


5 ld{(n)  = 


0  XSl(n)  +  Xr/2(n)<T 

X s,(n)  +  A 5/(n)  -  T  X 5,(n)  +  A^(«)  >  T. 


(13) 


By  applying  the  expectation  operator  in  (13).  the  first  equality  is  obtained.  To  prove  the  second 
equality  in  (11 )  -  and  thus,  show  that  d$  is  conserved  -  it  suffices  to  show  that  Xr^(n)  is  independent 
of  the  policy  /  G  T .  For  any  /  G  T , 


f  max{0,AS2(n)  +  ASl(n)  +  A5/(n)-T}  if  AS](n)  +  A^(n)  <  T 
\  Xs2(n)  if  XSi(n)  +  A^(n)  >  T. 


(14) 


Therefore,  A^(7?)  depends  on  A -  1)  which  may  in-turn  be  dependent  on  /  and  is  otherwise 
independent  from  /.  By  induction  A^(n)  is  only  dependent  on  the  initial  conditions  A£2(0)  which  is 
independent  of  the  policy  and  equal  to  zero.  Thus,  the  total  residual  traffic  in  frame  n  is  independent 
of  the  policy  /  E  T  and  therefore  can  be  denoted  as  A^2(n)  .  That  is, 


Ar5/(n)  =  £  X[f(n)  =  XrS2(n),  Vn,  V/  G  T,  (15) 

ies2 

and  therefore  proving  the  second  equality.  Finally,  it  is  a  well  known  result  that  d is  the  minimum 
dropping  rate  since  it  is  induced  bv  a  WC-EDD  policy [1 1] .  □ 

Since  the  total  residual  traffic  in  frame  n,  A£2(n),  is  independent  of  the  policy  /  E  T ,  the 
following  corollary  is  self-evident  in  view  of  the  previous  theorem. 
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Corollary  3.1  The  number  of  service  opportunities  for  the  new  arrivals  from  S 2  in  frame  n  is 
independent  of  the  selected  f  £  T .  Therefore ,  the  residual  traffic  for  any  source  i  in  frame  ( n+  1). 
A[’^(n+ 1);  is  only  dependent  on  the  policy  f  chosen  in  the  present  frame,  n,  and  not  in  past  frames. 

Corollary  3.1  is  employed  in  the  proof  of  Theorem  3.5  and  in  the  development  of  a  class  of 
policies  in  section  5. 

Let  A^(n)  =  Ylieg  ^i(n)  aggregate  arrivals  from  sources  in  subset  g  in  frame  n.  Let  d * 

denote  the  aggregate  packet  dropping  rate  associated  with  sources  in  group  g  only,  under  policy  /; 
all  N  sources  in  5  are  assumed  to  be  present  and  served  under  the  policy  /.  d f  is  defined  by, 


dU  = £ 


=  J2E[di(n)\  S^S.  (16) 

i£g  i£g 

The  following  lemma  will  be  used  in  the  proof  of  the  theorem  that  follows. 

Lemma  3.1  Let  g  C  52  and  let  A Tg^(n)  —  Ylieg  K^(n)  & e  ^ e  aggregate  residual  traffic  form  subset 
g  under  some  policy  f.  Then , 

A T3’f(n )  >  \Tg(n),  Vr?  and  V/  £  P,  (17) 

where  A rg(n)  is  given  by. 


Ar(ri  +  1)=<  m&xi°'X9(n)  + xsJ{n)  + XTS2(n)-T}  if  A5,(n)  +  A^fn)  <  T 
\  Aa(n)  if  As,(n)  +  A^(n)  >  T. 


(18) 


A^(r?)  is  called  the  minimum  residual  traffic  process  for  sources  in  g  £  S2,  for  a  system  that  is 
served  under  a  IVC-EDD  policy.  □ 

Proof.  The  above  inequality  is  self-evident  since  A£(n)  corresponds  to  the  residual  traffic  under  a 
policy  in  which  new  arrivals  from  sources  in  g  are  given  service  priority  new  arrivals  in  {52  —  g }. 
□  . 

Theorem  3.2  Let  b{g}  denote  the  lower  bound  for  the  aggregate  packet  dropping  rate  for  sources 
in  set  g.  g  C  5.  under  any  policy  f  £  T .  Then  this  bound  is  given  by, 


b{g}  =  {E  [AJ(n)  |  A rg(n)  >  t]  -  t}  P  ( AJ(»)  >  t)  ,  g  C  S- 


(19) 


b{g}  =  [E  [A,(n)  |  A3(n)>T]  -  T}  P (Xg(n)  >  T) ,  g  C  5,. 


(20) 


b{g}  =  {E  [Ax(n)  +  A'(n)  |  Ax(n)  +  AJ(n)  >t]~t}p  (\x(n)  +  A J(n)  >  t)  , 

9  =C  5, 

x  =  {g  n  Si}  ±  0  and  y  =  {g  D  S2 }  #  0 


(21) 
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Proof.  If  g  C  52,  the  bound  b{gy  is  achieved  when  A£(n)  packets  do  not  see  any  interference 
from  packets  in  S\  and  {52  -  5},  or  equivalently,  all  T  resources  are  available  to  service  the  A£(rc) 
packets.  Therefore,  by  employing  A Tg(n)  in  (11)  the  proof  follows.  The  probability  distribution  of 
the  minimum  residual  traffic,  A£(n),  is  calculated  from  (18)  as  in  section  4.  If  g  C  Si,  the  proof  can 
be  derived  as  in  [10].  If  g  C  5,  where  x  =  {yfl5i}  and  y  =  {<7(152}  are  non-empty,  b{gy  is  achieved 
when  the  traffic  Ax(n)  +  A£(ra)  does  not  see  interference  from  traffic  from  sources  in  {5  —  g }.  By 
generalizing  (19),  the  proof  follows.  □ 

The  following  class  of  policies  is  needed  for  the  establishment  of  the  region  of  achievable  QoS 
vectors  under  the  WC-EDD  policies,  as  well  as  for  the  development  of  policies  which  deliver  any 
achievable  QoS  vector  for  this  region. 


Definition  3.4  A  Deadline-Sensitive  Ordered  Head  of  Line  (DSO-HoL)  priority  service  policy  is 
defined  to  be  the  policy  which  first  separates  packets  into  two  sets:  packets  having  a  service  deadline 
in  the  current  frame  and  packets  having  a  service  deadline  in  the  next  frame.  Packets  from  sources 
having  a  service  deadline  in  the  current  frame  are  serviced  according  to  a  priority  service  policy, 

7T=  ( 7T  ] ,  7T  2 _ ,  7T  Ar )  *  7r i  E  {1,2  ,...,Ar},  7T{  /  7 Tj,  1  <  i,  j  <  N.  The  index  of  7T;  indicates  the 

order  of  the  priority  given  to  the  i T{  source  to  service  packets  having  a  service  deadline  in  the  current 
frame.  None  of  the  tTj  sources,  j  >  i ,  may  be  served  as  long  as  packets  with  current  service  deadline 
from  sources  tt^  .  k  <  i,  are  present.  After  servicing  the  packets  having  a  current  deadline,  the  same 
service  policy .  rc=  (fil,  7T2, . . . ,  7 ryv),  is  followed  for  packets  from  sources  that  are  present  and  do  not 
have  a  current  deadline.  □ 


Theorem  3.3  The  following  constraints  associated  with  the  induced  packet  dropping  rate  vector 
d/  =  (d{  .d{ . dj^r)  are  satisfied  by  any  scheduling  policy  f  E  T , 

d[9)>b{gy  V<7  C  5.  V/  G  T  (22) 

where  d^g j  and  b^  are  given  in  (16)  and  (19)-(21).  □ 


Proof.  Let  g  C  S  and  x  —  g  R  S\  and  y  =  g  fl  52.  Suppose  that  the  packets  from  sources  in  g  are 
served  under  a  scheduling  policy  fQ  €  T  according  to  which  they  are  given  Deadline  Sensitive  Head 
of  the  Line  (DS-HoL)  priority  over  the  packets  from  sources  in  {S  —  g}.  That  is,  during  overload 
conditions  no  packet  from  sources  in  {5  —  <7}  is  served  unless  no  packets  with  a  current  service 
deadline  from  sources  in  g  are  present.  During  under-load  conditions,  no  packets  from  sources  in 
{52  —  y}  are  serviced  while  packets  from  sources  in  y  =  {g  D  52}  are  present,  and  the  minimum 
residual  traffic  is  achieved  for  set  y .  Thus, 


Y,d<°(n)  = 


0  if  A x(n)  +  A ry(n)  <  T 

Ax(n)  +  A;(n)-T  \{  \x(n)  +  \Ty(n)  >  T . 


(23) 
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From  (21)  and  (23)  it  is  clear  that, 

d{h  ~ 

for  all  policies  f0  G  T  that  provide  DS-HoL  priority  to  the  packets  from  sources  in  g.  Since  no 
other  policy  /  6  T  can  provide  better  service  (that  is,  lower  aggregate  packet  dropping  rate)  to 
sources  in  g  than  /0,  it  is  evident  that, 

d{g}>big},  VfET,  gCS.  □  (25) 

3.2  Region  of  Achievable  QoS  Vectors 

The  main  result  of  this  section  is  the  determination  of  the  region  V  of  the  achievable  QoS  vectors 
d  under  WC-EDD  policies.  The  following  corollary  provides  a  set  of  necessary  conditions  in  order 
for  a  QoS  vector  d  to  be  achievable,  followed  by  a  corollary  regarding  an  upper  bound  on  the 
achievable  region  for  QoS  vectors  d  under  WC-EDD  policies.  Their  proofs  are  self-evident  in  view 
of  Theorems  3.1  and  3.3. 

Corollary  3.2  A  necessary  condition  in  order  for  a  QoS  vector  d  =  (dj,  c^, . . . ,  dyv)  to  be  achieved 
by  a  policy  f  £  T  is  that  its  components  satisfy  the  following  constraints, 

d{g}  >  b{g}  VgCS  (26) 

d{s}  —  b{s}-  1=1  (2~) 

Corollary  3.3  Let  Vu  denote  the  collection  of  all  vectors  d  satisfying  (26)  and  (27).  Then  Vu  is 
an  upper  bound  on  the  region  D  of  achievable  QoS  vectors  d.  That  is, 

V  C  Vu.  □  (28) 


The  following  theorem  describes  the  vectors  contained  in  Vu. 

Theorem  3.4  Any  vector  in  the  set  Vu  can  be  expressed  as  a  convex  combination  of  extreme 
points  (vertices)  of  Vu;  that  is,  Vu  may  be  expressed  as  the  convex  hull  of  its  extreme  points, 
Vu  =conv[ext(Vu )].  □ 

Proof.  The  proof  follows  from  the  fact  that  Vu  is  a  bounded  set  defined  by  a  finite  intersection  of 
closed  half  spaces,  (see  (26)  and  (27)).  Then  by  definition,  Vu  is  a  polytope  [13]  and  Theorem  3.4 
follows  directly  from  properties  of  polytopes  [13].  □ 

The  following  theorem  establishes  a  relationship  between  scheduling  policies  and  the  vertices 
of  Vu. 
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Theorem  3.5  d*  is  a  vertex  (extreme  point)  of  the  setT>u  iff  d*  is  a  dropping  rate  vector  resulting 
from  a  Deadline-Sensitive  Ordered  HoL  (DSO-HoL)  priority  service  policy,  7r=  (tti,  7r2,  •  •  d 


Proof.  Assume  that  d*  is  a  vertex  of  Vu.  Then  d*  must  lie  at  the  intersection  of  N  hyper-planes 
and  its  coordinates  must  satisfy  N  simultaneous,  linearly  independent  equations  (definition  of  a 
polytope  vertex,  see  [13]),  given  by, 


~  j  —  1,2,..., A, 


(29) 


where  one  of  the  gff  s  is  the  set  5  =  {1,2,  and  the  remaining  (N  -  1)  are  proper,  non-empty 

and  different  subsets  of  S.  Lemma  10.3  (see  appendix)  establishes  that  the  subsets  gfs  are  strictly 
included  in  each  other.  Therefore,  by  adopting  the  order  g\  C  <?2  C  •  •  •  C  ffjv-i  C  gu  =  S.  the  gfs 
are  given  by, 


9\  -  {^1}  (3°) 

9\  C  52  =  {tti  ,  7T2> 

52  c  92,  =  {^l  1  7r25  ^3} 


5A  -1  c  gK  =  {^n ,  tt2,  - .  - ,  tTjv}  =  {1.2 - ,N}, 

where  7 r;  €  {1.2, _ A'},  7 r,-  ^  Try,  for  i  ■£  j,  1  <  i,  j  <  AM  By  using  (21)  the  following  can  be 

obtained  from  (29). 

-  b{*i) 

^{r2}  =  ~  b{”l} 

d{x3}  =  b{ T 1  ,'*2 *^3  }  ~  b{*U*2} 

;  (31) 

=  ^{^1,^2.—.^}  ~  b{*U*2,—*N-l  }’ 

which  is  precisely  the  dropping  rate  vector  induced  by  the  DSO-HoL  priority  policy  7r=  (7Ti,7r2,. .  .,7 ryv) 
Thus,  for  any  vertex  (extreme  point)  there  exists  a  DSO-HoL  priority  policy  that  induces  it.  Since 
it  is  easy  to  see  that  the  dropping  rate  vector  resulting  from  a  DSO-HoL  priority  policy  must  satisfy 
(31),  it  is  evident  that  these  dropping  rate  vectors  will  be  the  vertices  of  Vu.  □ 

Fig.  2  and  Fig.  3  provide  a  graphical  illustration  of  the  region  Vu  for  the  case  of  N  =  2 
and  Ar  =  3  sources,  respectively.  The  extreme  points  dei(_i’s  correspond  to  QoS  vectors  induced 
by  the  (AM)  DSO-HoL  priority  policies  x!=  ( tti ,  7r2 , . . . ,  7 r/v),  1  <  i  <  AM,  as  shown  in  Theorem 
3.5.  Referring  to  Fig.  2,  it  may  be  observed  that  the  policy  (7Ti,7r2)  =  (1,2)  corresponds  to  the 
intersection  of  the  line  for  the  lower  bound  on  the  packet  dropping  rate  line  for  source  1,  6^},  with 
the  system  dropping  rate  line  f>{i,2}-  Similarly,  the  second  extreme  point  induced  by  the  policy 
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Figure  2:  The  region  (polytope)  Vu  for  a  system  with  two  sources. 

(m.7T2)  =  (2,1)  is  the  intersection  of  the  lower  bound  on  the  packet  dropping  rate  for  source  2, 
b{2}  and  Similar  observations  can  be  made  for  the  region  Vu  for  a  system  of  N  =  3  sources 

shown  in  Fig.  3. 

Let  V.  V  C  Vu ,  be  the  achievable  region  of  QoS  vectors  d.  The  following  theorem  establishes 
its  convexity. 

Theorem  3.6  Let  di.d2  E  V ,  then  d3,  where 

d3  =  ad]  +  (1  -  a)d2,  a  >  0,  a  <  1,  (32) 

is  also  in  V .  That  is  V  is  convex .  □ 

Proof.  Let  d]  (d2)  be  the  dropping  rate  vector  induced  by  the  policy  /]  (/2)  E  T\  that  is, 
d].d2  E  V .  Consider  a  scheduling  policy  /3  that  at  each  under-loaded  frame  decides  to  follow 
the  scheduling  rule  of  policy  f\  with  probability  a  and  policy  /2  with  probability  (1  —  a);  the 
decisions  over  consecutive  under-loaded  frames  are  independent.  Notice  that  no  dropping  occurs 
during  under-loaded  frames,  and  thus,  any  policy  /  E  T  could  be  selected.  However,  the  residual 
traffic  in  the  first  overloaded  frame  depends  on  the  policy  selected  in  the  previous  frame  (only), 
the  policy  selection  in  under-loaded  frames  does  affect  the  performance  (drops)  during  overloaded 
frames1.  Decisions  over  the  first  overloaded  frame  following  an  under-loaded  one,  follow  the  policy 

lIn  fact  any  policy  /  E  T  can  be  selected  during  an  under-loaded  frame,  except  for  the  frame  before  an  overloaded 
one,  since  shown  in  corollary  3.1,  the  residual  traffic  for  source  i  in  frame  n,  A^1(2)(n)  is  only  dependent  on  the 
policy  2)  selected  in  frame  (n  —  1)  and  not  in  earlier  frames. 
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Figure  3:  The  region  (polytope)  Vu  for  a  system  with  three  sources. 

selected  for  the  previous  frame,  ensuring  that  the  Ap^1(2)(n)  packets  are  serviced  under  policy  /i(2)2* 
Over  overloaded  frames  other  that  the  first  of  an  overloaded  period,  policy  /i(/2)  is  selected  with 
probability  a  (1  —  a)  independently  from  the  selection  in  the  previous  frame.  Note  that  the  selected 
policy  during  an  overloaded  frame  does  not  affect  the  residual  traffic  in  the  next  frame,  refer  to 
( 18).  It  should  also  be  noted  that  the  aggregate  traffic,  (ft)  +  A^n),  is  independent  of  the  policy 
/  £  T  (Theorem  3.1).  This  implies  that  the  occurrence  of  overloaded  periods  is  independent  of  the 
policy  and  the  length  of  consecutive  overload  periods  are  independent  and  identically  distribute 
random  variables,  regardless  of  the  selected  policy.  In  view  of  the  above,  it  is  easy  to  establish 
that'the  dropping  rate  performance  of  policy  /^j  is  induced  with  probability  a  (1  —  a).  Thus,  the 
packet  dropping  rate  induced  by  policy  53  is  given  by, 

d3  =  od]  +  (1  -  o)d2-  (33) 

Since  d3  is  achieved  by  a  policy  in  T ,  d3  E  V ,  establishing  the  convexity  of  V.  □ 

The  next  theorem  establishes  the  region  V  of  achievable  QoS  vectors. 

Theorem  3.7  V  =  Vu .  □ 

Proof.  Since  V  C  Vu  (Corollary  3.3),  it  suffices  to  establish  that  Vu  C  V  to  complete  the  proof. 
Notice  that  Vu =conv[ext(Vu)]  (Theorem  3.4)  and  that  V  is  convex  (Theorem  3.6)  and,  thus,  if 
{ext{Vu)}  C  V  then  Vu  C  V  .  The  latter  holds  true  since  the  extreme  points  of  Vu  are  induced 

2 Otherwise,  packets  dropped  in  the  overloaded  frame  under  policy  f\  would  be  associated  with  the  residual  traffic 
generated  in  the  previous  (under-loaded)  frame  under  policy  $2\  the  resulting  performance  would  be  that  of  neither 
/i  nor  /2. 
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by  the  DSO-HoL  priority  policies  (Theorem  3.5)  and,  thus,  these  points  are  in  the  region  V  of 
achievable  QoS  vectors.  □ 


4  Analysis  of  Residual  Traffic 


In  this  section  the  probability  distribution  of  the  minimum  residual  traffic  process,  A rg(n),  is  de¬ 
rived  when  the  arrival  processes  A,(n),  i  £  5,  are  independent  and  each  modeled  by  a  sequence 
of  independent  identically  distributed  random  variables.  As  seen  in  (14),  future  evolution  of  the 
total  residual  traffic  process,  A s2(n  +  1),  depends  only  on  the  present  values  of  A s2{n),  A Sj(n)  and 
As2(n).  Therefore,  under  the  i.i.d.  assumption  for  the  arrival  processes,  A s2(n)  is  Markovian  with 
transition  probabilities, 


P(\rs2(n  +  1)=  i  A rS2(n)  =  j)  = 


P(\s2(n)  +  mm(j  +  \Sl(n),T)-T  =  i)  i  >  0, 
P(\s2(n)  +  min(;  +  A Sl(n),T)  <T)  i  =  0. 


(34) 


Assuming  the  stationary  distribution  of  A rs  (n)  exists,  the  distribution  vector,  A£2,  is  easily  com¬ 
puted  as. 


Ars  P  = 
->2 


'*^2  ' 


(35) 


where  P  is  the  probability  transition  matrix  with  elements  pjj  given  in  (34). 

The  distribution  of  the  minimum  residual  traffic  process  for  subset  g  in  (18)  is  found  by 
conditioning  on  A^2(n).  As  shown  in  Theorem  3.1,  no  matter  what  the  subset  g  under  consideration, 
the  quantity  A^(t?)  is  conserved.  The  stationary  distribution  for  A£(n)  is  found  as, 

P( A»  =  i)  =  EP(A^n)  =  *|*s2("  -  1)  =  -  1)  =  j)-  (36) 

J 

From  the  i.i.d.  assumption, 


P(\l(n  +  l)  =  i\rs2(n)  =  j)  = 


P(Xg{n)  +  min  (j  +  XSj  (n),  T)  -  T  =  i)  i  >  0, 
P(Xg(n)  +  min(j  +  A5l(n),T)  <T)  i  =  0. 


(37) 


5  A  Class  of  Policies 

Let  C dso-HoL  denote  the  class  of  Delay-Sensitive  Ordered  Head  of  Line  (DSO-HoL)  service  policies 
7 t1  introduced  in  section  3.1. 

Definition  5.1  In  each  under-loaded  frame ,  a  Mixing  DSO-HoL  Policy  fm  decides  to  follow  the 
DSO-HoL  policy  rrl  with  probability  a;.  >  0,  1  <  i  <  AT!,  ai  =  1/  decisions  over  consecutive 

under-loaded  or  overloaded  frames  are  independent.  The  DSO-HoL  policy  from  the  previous  under¬ 
loaded  frame  is  chosen  in  the  first  overloaded  frame.  Clearly,  fm  is  completely  determined  by  the 
N\  dimensional  vector  ot,  <x  >  0,  1*  ol  =  1.  Let  M DSO-HoL  denote  the  class  of  such  policies.  □ 
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The  proof  of  the  following  theorem  follows  directly  from  the  proof  of  Theorem  3.6. 

Theorem  5.1  The  dropping  rate  vector  induced  by  a  Mixing  DSO-HoL  policy  is  given  by , 

N\ 

d/m  =  (38) 

i=i 

The  following  theorem  establishes  the  main  results  of  this  section. 

Theorem  5.2  For  each  packet  dropping  rate  vector  d  £  V  there  exist  a  policy  fm  E  Mdso-HoL 
that  induces  d.  □ 

Proof.  Let  d  E  V.  then  d  =  ai^ext~i  for  some  a=  (0^,02, ..  .,ayv»)  where  a,*  >  0,  1  < 
i  <  NL  ai  ~  1?  since  any  point  in  V  can  be  written  as  a  convex  combination  of  the  extreme 

points  (vertices)  dej:t-i  of  V{~  Vu );  each  d ext~i  is  induced  by  some  policy  in  Cdso-HoL  (Theorem 
3.5). 

Let  fm  be  the  mixing  policy  which  selects  the  HoL  priority  7T1  (that  induces  deXf_;)  with 

probability  a,*.  The  packet  dropping  rate  vector  d induced  by  fm  is  given  by, 

N\ 

d/m  =  5ZQ<dext-i  (39) 

i=l 

and  thus  fm  induces  d.  □ 

The  following  corollary  is  obvious  in  view  of  the  above  theorem. 


Corollary  5.1  Let  d  E  V  be  a  target  packet  dropping  rate  vector .  The  mixing  policy  fm=a. 

induces  d.  where  a  is  such  that . 

TV! 

£‘a[d,x(]i^a,dcx<_1-  =  d.  (40) 

i=l 

a  >  0.  (41) 

1  •  a  =  1,  (42) 


where  Ecx  [•]  weighted  average  of  the  set  of  extreme  points.  deX(,  ofV  with  respect  to  the  Probability 
Mass  Function  a.  □ 


6  Upper  Bound  on  the  Region  of  Achievable  QoS  Vectors  Under 
Any  Service  Policy 


In  the  previous  section  the  region  of  achievable  QoS  vectors  induced  by  WC-EDD  policies  (denoted 
in  this  section  by  J)EDD)  was  determined  and  described  in  terms  of  conditions  (26)  and  (27), 
restated  (using  superscript  EDD)  as, 


d{9 }  ^ 


lEDD 

b{9 } 


V<7CS, 


(43) 


b{F}D  an^  bfg}D  are  l°wer  bounds  on  the  performance  of  WC-EDD  policies  as  given  in  (11)  and 
(21),  respectively.  It  is  well  known  that  the  WC-EDD  polices  optimize  the  system  (5)  performance 
by  minimizing  the  system  dropping  rate.  Therefore,  any  policy  that  attempts  to  improve  (decrease) 
the  dropping  rate  for  a  subset  of  sources  g  beyond  the  lower  bound  shown  in  (18)-(21)  by  relaxing 
the  EDD  condition  will  result  in  increased  system  ( S )  system  dropping  rate.  That  is.  the  lower 
bounds  on  the  dropping  rates  achieved  by  the  WC-EDD  policies  and  the  class  of  policies  which  do 
not  necessarily  satisfy  the  EDD  condition  (denoted  here  as  an  Unconstrained  (UC)  policy)  satisfy 
the  following  conditions. 


hEDD  > 
bi9 )  ^ 


C  5, 


> 


b 


EDD 
{5}  * 


(44) 


b1^  is  the  unconstrained  lower  bound  on  the  dropping  rate  for  the  sources  in  set  g.  This  bound 
is  achieved  by  considering  that  packets  from  sources  in  set  g  only  are  present  and  serviced  under 
a  work-conserving  earliest  due  date  policy;  sources  in  {5  —  g}  are  considered  to  be  absent,  bE^  is 
calculated  by  applying  ( 1 1)  and  ( 14)  to  the  set  g  only.  That  is,  replacing  S\  and  52  with  x  —  {pfl5i} 
and  y  =  {#14  52}  in  (11)  and  (14),  respectively. 

Since  no  policy  can  do  better  for  sources  in  set  g  than  6^  when  all  sources  in  S  are  present, 
the  following  necessary  conditions  must  be  satisfied  by  any  QoS  vector  d  which  is  achieved  under 
some  policy. 


d{g) 

<l{s) 


> 

> 


bt sT- 


V<7  C  5, 


(45) 


The  following  proposition  is  self-evident  in  view  of  the  above  discussion. 


Proposition  6.1  An  upper  bound  on  the  region  of  QoS  vectors  achieved  under  any  policy,  Vuc-U , 
is  give  n  by  (J^o).  □ 

Fig.  4  depicts  the  region  J)EDD  and  VUC'U  for  the  case  of  two  sources. 


7  Region  of  Acceptable  QoS  Vectors 

Definition  7.1  The  region  of  acceptable  QoS  vectors,  associated  with  the  region  of  achievable  QoS 
vectors  V .  A{V).  is  defined  to  be  the  region  of  vectors  d  satisfying , 

d{g }  ^  b{g}  ^9  C  S,  (46) 

d{S}  >  b{S}-  D 

Proposition  7.1  If  d  £  A{T>)  then  there  exists  a  vector  d'  E  V  which  is  such  that  d[  <  d{  Vi  6  5. 

□ 
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Figure  4:  Necessary  performance  bounds  under  any  policy  and  the  sufficient  bounds  for  WC-EDD 
family  of  policies. 

The  proof  of  the  above  proposition  may  be  found  in  [10].  Proposition  7.1  implies  that  if  the  required 
QoS  vector  d  is  in  A{ P),  there  exist  a  policy  which  can  deliver  d  or  better  (that  is,  less  than  the 
dropping  rate  required  by  any  source).  Fig.  5  depicts  the  acceptable  region  A(VEDD)  and  the 
upper  bound  on  the  region  of  achievable  QoS  vectors  Pl  c,u. 

A(j)EDD )  contains  all  the  QoS  vectors  which  can  either  be  delivered  exactly,  or  a  better  QoS 
vector  can  be  delivered  under  some  WC-EDD  policy.  VUC'U  is  an  upper  bound  on  the  QoS  vectors 
which  can  be  delivered  under  any  policy. 

8  Numerical  Example  and  Verification  through  Simulation 

In  this  section  two  examples  are  presented  for  a  system  with  two  and  three  sources,  respectively, 
competing  for  T  slots  in  a  TDMA  frame.  The  source  packet  arrival  processes  are  assumed  to  be 
mutually  independent.  Each  process  is  described  in  terms  of  a  sequences  of  independent  and  identi¬ 
cally  distributed  random  variables  embedded  at  the  frame  boundaries.  Let  Ek  =  {0, 1, . . .,  Mk  —  1} 
denote  the  state  space  of  the  arrival  process  associated  with  source  k. 

Consider  a  system  with  5  ( T  =  5)  slots  per  frame  and  2  sources,  k  =  2.  In  this  example  A/1  =  7 
and  A/2  =  7  and  when  in  state  i.  a  source  generates  i  packets  in  the  current  frame.  Source  1  has  a 
maximum  arrival  rate  of  6  packets  per  frame  with  an  average  per  frame  arrival  rate  of  3.6.  Source 
2  has  an  average  arrival  rate  of  3.2  packets  per  frame  with  a  maximum  arrival  rate  of  6  packets  per 
frame.  Sources  1  and  2  have  state  probability  distributions  of  (0.0,  0.1,  0.1,  0.3,  0.2,  0.2,  0.1)  and 
(0.1.  0.0,  0.4,  0.0.  0.3,  0.0,  0.2),  respectively.  Sources  1  has  a  maximum  delay  tolerance  of  1  frame 
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Necessary 

Bounds 


Figure  5:  Acceptable  QoS  region  under  WC-EDD  policies  and  its  upper  bound  under  any  policy. 


bfDD 

b$DD 

hEDD 

V2) 

0.100 

0.160 

1.856 

Table  1:  Lower  bounds  on  dropping  rates  under  WC-EDD  policies. 

while  sources  2  has  a  maximum  delay  tolerance  of  2  frames. 

The  region  of  achievable  QoS  vectors  under  WC-EDD  policies  for  this  system  is  calculated 
from  (11).  (14).  (18)  and  (21);  the  bounds  on  the  dropping  rates  are  given  in  Table  1.  The 
unconstrained  lower  bounds  were  evaluated  by  applying  (11)  and  (14)  to  each  source  k  only,  and 
the  results  are  presented  in  Table  2.  Notice  that  the  unconstrained  lower  bound  for  source  1,  b^c, 
cannot  be  decreased  beyond  the  WC-EDD  lower  bound,  bfDD,  since  every  packet  from  source  1 
has  a  maximum  delay  tolerance  of  1  frame.  The  WC-EDD  bound  for  source  2  can  be  decreased  by 
relaxing  the  EDD  condition  on  the  system;  that  is,  if  the  policy  allows  service  to  new  arrivals  from 
source  2  before  packets  from  source  1,  a  lower  dropping  rate  will  be  delivered  to  source  2.  The 
decrease  in  the  dropping  rate  beyond  the  WC-EDD  bound  for  source  2  is  achieved  at  the  expense 
of  increasing  the  system  dropping  rate.  This  result  is  verified  through  simulations  and  displayed  in 
Fig.  C. 

In  the  simulation  results,  policies3  were  generated  that  ranged  from  a  policy  that  gives  all 
packets  from  source  2  service  priority  over  packets  from  source  1,  to  the  DSO-HoL  service  policy 
7t=(2,1)  in  which  only  the  packets  with  a  current  service  deadline  from  source  2  were  given  priority 
3The  performance  of  the  DSO-HoL  policy  n  =  (1,2)  was  also  verified  through  simulation  and  is  displayed  in  Fig. 

6. 
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b"c 

b\c 

0.100 

0.001 

Table  2:  Lower  bounds  on  dropping  rates  under  any  policies. 


Numerical  and  Simulation  Results 


Dropping  Rate,  Source  1  (packets/frame) 


Dropping  Rate,  Source  1  (packets/frame) 


Figure  6:  Evaluation  of  a  numerical  example  with  supporting  simulation  results. 
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State 

Source  1 

Source  2 

Source  3 

0 

0.200 

0.600 

0.360 

1 

0 

0 

0 

2 

0.405 

0.400 

0.480 

3 

0 

0 

0 

4 

0,295 

0 

0.160 

5 

0 

0 

0 

6 

0.200 

0 

0 

Table  3:  State  probability  distribution  for  sources  1,2  and  3  of  example  2. 

over  packet  from  source  1.  Under  the  former  policy,  the  UC  lower  bound  (62  =  0.001)  is  achieved 
for  source  2,  but  the  system  dropping  rate  is  increased  from  its  minimum  of  1.859  to  2.025  packets 
per  frame.  The  three  policies  which  induce  the  other  points  shown  in  Fig.  6  were  obtained  bv 
varying  the  frequency  at  which  source  2  was  allowed  to  violated  the  EDD  condition.  As  is  clearly 
observed  in  Fig.  6.  for  each  of  the  four  policies  that  violate  the  EDD  condition,  the  resulting  system 
dropping  rate  is  increased  compared  to  the  conserved  system  dropping  rate  induced  by  WC-EDD 
policies. 

As  it  was  shown  in  section  3.2,  satisfying  the  condition  on  the  system  performance,  given  by 
(27),  is  only  necessary  and  not  sufficient  to  guarantee  that  the  target  QoS  vector  is  achievable4.  To 
illustrate  this  concept,  consider  the  system  given  in  the  previous  example  and  let  d  =  (0.099, 1.760) 
be  the  target  QoS  vector,  d  satisfies  the  condition  on  the  system  dropping  rate,  that  is  d{$}  — 
b^D  =  1.859.  Although,  d\  <  b\rc  =  0.100,  and  thus  the  target  QoS  vector  cannot  be  achieved 
under  any  policy.  For  this  system,  the  best  possible  performance  source  1  can  receive  is  if  it  is 
given  absolute  service  priority,  resulting  in  dropping  rates  of  0.100  and  1.759  for  sources  1  and 
2,  respectively.  The  overall  system  performance  is  satisfied,  but  source  1  is  experiencing  poorer 
service  than  what  is  desired,  while  sources  2  is  experiencing  improved  performance. 

To  illustrate  the  design  of  WC-EDD  scheduling  polices,  consider  the  following  example  con¬ 
sisting  of  three  VBR  sources  competing  for  7  (T  =  7)  slots  of  a  TDMA  frame.  In  this  example 
A/1  =  3.  M 2  =  5  and  M 3  =  7  and  when  in  state  i,  a  source  generates  i  packets  in  the  current 
frame.  Each  source  is  modeled  by  a  sequence  of  i.i.d.  random  variables  with  a  maximum  arrival 
rate  of  2.  4  and  6  packets  per  frame,  respectively.  The  probability  distributions  for  each  source  are 
given  in  Table  3. 

Packets  from  source  1  have  a  maximum  delay  tolerance  of  1  frame  while  packets  from  sources 

4 In  the  speciaJ  case  of  a  homogeneous  system,  such  as  a  cellular  voice  system,  satisfying  (27)  is  sufficient  to 
guarantee  that  the  target  QoS  vector  is  achievable.  This  result  has  been  established  in  [10]. 
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Figure  7:  Time-averaged  dropping  rate  for  each  source  under  the  policy  fm. 

2  and  3  can  tolerate  delays  of  up  to  2  frames.  The  maximum  dropping  probabilities  (resulting  from 
delay  violations)  acceptable  for  sources  1,  2  and  3  are:  pi  =  0.02,  P2  =  0.01  and  p$  =  0.02.  The 
QoS  vector  in  this  case  is:  d=  (0.064,0.008.0.032). 

By  using  (46).  it  can  be  determined  that  d  is  in  the  region  of  acceptable  QoS  vectors  for  WC- 
EDD  polices.  Furthermore,  it  can  be  shown  that  d  E  VEDD ,  and  thus  (as  shown  in  section  5)  there 
exists  a  mixing  DSO-HoL  policy  /m  =  a.  delivering  it.  Any  a  satisfying  (40)-(42)  of  Corollary  5.1 
may  be  chosen.  For  this  example,  the  following  ot0  was  chosen  by  employing  linear  programming 
techniques. 

o\  =0 
o2  =  0 
03  =  0.3367 

04  =  0.5386 

05  =  0 

o6  =  0.1248 

In  Fig.  7,  the  time-averaged  performance  for  each  source  under  the  selected  mixing  policy,  /m, 
is  displayed.  It  can  be  seen  that  the  target  dropping  rates,  d;,  (and  therefore  dropping  probabilities) 
are  achieved  for  each  source,  i  =  1,2,3. 

As  it  is  expected  from  the  formulation  of  the  constraints,  more  than  one  solution  may  be  found. 
This  allows  for  the  incorporation  of  additional  constraints  representing  other  desirable  qualities  of 
the  policies.  Functions  of  interest  may  be  minimized  subject  to  the  constraints  presented  in  this 
paper  to  guarantee  the  achievability  of  the  resulting  policies.  For  instance,  among  all  mixing  policies 
inducing  d,  the  one  which  minimizes  the  variance  in  the  service  provided  to  certain  sources  may 


20 


be  identified.  Such  additional  objectives  will  be  pursued  in  the  future. 


9  Conclusion 

In  this  work,  the  region  of  achievable  QoS  vectors  was  precisely  determined  for  a  system  of  het¬ 
erogeneous  VBR  sources  serviced  under  policies  that  are  work-conserving  and  satisfy  the  earliest 
due  date  service  criterion  (WC-EDD).  The  QoS  requirements  for  each  application  were  defined  in 
terms  of  a  maximum  tolerable  packet  delay  and  dropping  probability.  In  addition  to  determining 
the  region  of  achievable  QoS  vectors,  a  class  of  scheduling  policies  was  developed  which  delivers 
any  achievable  performance  in  the  region  established  for  WC-EDD  policies.  The  effectiveness  of 
these  policies  in  delivering  the  QoS  vectors  was  verified  through  simulation.  Also,  an  upper  bound 
on  the  region  of  QoS  vectors  that  can  be  achieved  under  any  policy  was  determined.  Numerical 
examples  were  presented  with  simulation  results  verifying  the  theoretical  results  presented  in  this 
paper. 

10  Appendix 

Lemma  10.1  If  Xa  (Xb)  a re  generic  random  variables  representing  the  minimum  residual  traffic5 
from  sources  in  set  A(B)  C  5  in  frame  n  (as  defined  in  Lemma  3.1),  then  Xa{Xb)  satisfy  the 
following. 

A/t  +  Xb  =  Ax(n)  +  Xy(n)  +  A„(n)  +  A^,(n)  (48) 

<  A{xuu}(n)  +  A^Uu,}(n)  +  A{in„}(n)  +  A^nu,}(n) 

=  A^ubj  +  A^ng},  VA,BC.S, 

where  x  =  {.4  fl  S\ },  y  =  {/l  fl  ^2},  {u  =  B  n  5j }  and  {tn  =  B  fl  52}.  □ 

Proof.  Since  x.  v  C  Si,  X x(n)  +  Av(n)  =  A{xUy}(n)  +  A{xn„}(n)  and  therefore  (48)  holds  if, 

Xl{n)  +  XTw{n)  <  Xr{yUw}(n)  +  X\ynw}(n)  Vy,wCS2.  (49) 

As  stated  in  Corollary  3.1,  the  number  of  service  opportunities  for  the  new  arrivals,  A;(n),  i  6  52, 
(or  excess  resources)  is  independent  of  the  policy  /  G  T .  Let  X  represent  the  number  of  such 
opportunities  in  any  given  frame.  Therefore,  by  definition  in  (18),  (49)  is  rewritten  as, 

max{0,  Xy(n  —  1)  —  X}  +  max{0,  Au,(n  —  1)  -  A'}  (50) 

<  max{0,  A{yUu,}(n  -  1)  -  A'}  +  max{0,  A^n^n  -  1)  -  A'}  Vy,  w  C  S2- 

As  can  be  seen  from  (18),  equality  holds  in  (50)  when  A5j(n)  +  XTSi(n)  >  T  since  A'  =  0.  When 
Asj(n)  +  A rS2(n)  <  T  and, 

5  Residual  traffic  from  sources  in  Si  refers  to  the  traffic  that  has  a  transmission  deadline  in  the  current  frame. 
Thus,  the  minimum  residual  traffic  for  sources  in  set  Si  is  just  new  arrivals  from  sources  in  Si. 
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Case  i:  If  \yr\w  (n  —  1)  <  X  and  Ay(rc  —  1)  and/or  Xw(n  —  1)  <  X ,  then  (50)  holds6. 

Case  ii:  If  Aj,nu,(«  -  1)  >  X  =>  A y(n  -  1),  A w(n  -  1)  >  X,  then  equality  holds  in  (50). 

Case  iii:  If  Xynw(n  -  1)  <  X  and  A y(n  -  1),  A w(n  —  1)  >  X ,  then, 

Xy(n  —  1)  —  A  +  Xw(n  —  1)  —  A  -  Ay(n  —  1)  +  A  w(n  —  1)  —  2A  (51) 

=  A{yUtt,}(n  —  1)  —  X  +  (A{j,n u,y(n  —  1)  -  A  ) 

<  A^Utu}(n  -  1)  -  A +  max{0,  A^nu)}(n  -  1)  -  A'}, 

and  thus  proving  Lemma  10.1.  □ 

Lemma  10.2  bA  (defined  by  (21))  is  a  super-modular  set  function;  that  is. 

b.A  +  bB  <  b{AuB }  +  (52) 

and  equality  holds  only  if  and  only  if  A  C  B  or  B  C  A.  □ 

Proof.  Let  A 4  (A#)  be  generic  random  variables  representing  the  minimum  residual  traffic  from 
sources  in  set  A{B)  C  S  in  frame  n.  Using  Lemma  10.1,  the  following  can  be  established. 

{£[a.4  +  ab  I  a^  +  Ab  >r]  -t}p(a.4  +  as  >r)  (53) 

-  {E  [A{/luS}  +  A{/inB}  A{/4uS}  +  A{/4nB}  >  T  -  rj  P  (a^ub}  +  A^ns}  >  , 

VA.B  C  5. 

Case  (I):  Assume  A  and  B  are  disjoint  (A  fl  B  =  0),  then  (53)  is  written  as. 

{- E  [a.4  +  Xb\xa  +  Xb>t]-T}p(Xa  +  Xb>T)<  b{AuB}.  (54) 

The  following  implication  is  self-evident: 

Xa>T,\b<T 
^a  <  T,  \B  >  T 
XA  >  T,  \B  >  T 
A  a  <  T,\B  <  T,  A^4  +  A#  >  T. 

Assuming  that  all  sources  are  independent,  then  A^  and  \B  are  independent  since  sets  A  and  B 
are  disjoint,  thus, 

P(Xa  +  Xb>T)  =  P(Xa>T)P(Xb<T)  +  P(Xa<T)P(Xb>T)  (55) 

+P(Xa<T,XB<T,Xa  +  Xb>T) 
+P(Xa>T)P(Xb>T), 

6Under  Case  i,  strict  inequality  holds  if  Aj,(n  —  1),  Xw(n  —  1)  <  A'  and  X^yUw)(n  —  1)  >  A'. 
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and  therefore  (54)  may  be  written  as, 


b{AuB}  >  {E  [A.  +  Xb\xa>T,Xb<t]-T}p  (XA  >T)P (A B  <  T)  (56) 

+  { E  [XA  +  As  |  Aa  <  T,  Ab  >  T\  -  T)  P  (Xa  <T)P  (Ag  >  T) 

+  {e[xa  +  xb  |  xa>t,xb>t}-t}p(\a>t)p(Xb>t) 

+  ^E  [a.4  +  A b  XA  <  T,  Ag  <  T,  A  4  +  Ag  >  T  —  t|- 
P  (XA  <T,Xb  <  T,Xa  +  Ag  >  T) . 

Expanding  the  expected  value  operator  over  XA  and  Ag  and  rearranging  terms,  (56)  can  be  written 
as, 

b{AuB}  >  {e  [a,  \xa>t]-t}p(Xa>T)+  {e  [Ag  |  Ag  >  t]  -  t}  P  ( Ag  >  T)  (57) 
+E  [A,  |  Xa<t]p{Xa<T)P(Xb>T)  +  e[\b  |  Ag  <  T 
P(  A.4  >  T)P  (Ag  <  T)  +  TP(Xa  >  T)P(Xb  >  T) 

+  |j E  [a^i  +  Ag  XA  <  T,  Xb  <  T,  XA  +  Xb  >  T  —  T | 

P  (XA  <  T,  Xb  <  T,  XA  +  Xb  >  T) 

—  b{A)  +  &{B}  +  E 

where. 

K  =  E  [A^  |  XA  <  T]  P(Xa  <  T)P{Xb  >  T)  +  E  [Ag  |  Ag  <  t]  (58) 

P(XA  >  T)  P(XB  <  T)  +  TP(Xa  >  T)P( Ag  >  T) 

+  { E  [XA  +  Ag  |  A^  <  T,  Ag  <  T,  Xa  +  Xb>t]-  T } 

P(^a  <  Xb  <  T,\a  +  >  T). 

Notice  that  >  0  (for  the  cases  of  interest  corresponding  to  T  >  0).  Thus,  from  (57),  and  since 
b{AnB}  =  0  for  {A  fl  B }  =  0, 

t>A  +  <  b{AuB}  +  b{AnB}-  (59) 


or, 

&a  +  bs  =  4uB}  +  (60) 

where  CA'B  >  0  is  the  additional  expected  number  of  dropped  packets  as  a  result  of  the  competition 
for  resources  between  sources  in  set  A  and  B  when  {/Ini?}  =  0.  The  following  property -regarding 
CAB  is  used  to  complete  the  proof. 

Property  10.1 

c{Aux).B  _  CA,B  >  (61) 


23 


when  {A  fl  B}  =  0,  x  g  {A  U  B}  and  x  G  5.  □ 

Proof.  By  definition  of  CA,B  in  (60)  and  by  (21),  (61)  may  be  expressed  as, 


E 


max(A{.4uBux}  -  T,  0)  -  max(A{ylUl}  -  T,  0)  -  max(AB  -  T,  0)  - 


max(A{4uB}  —  T,  0)  +  max(A^  —  T,  0)  +  max(AB  -  T,  0) 


>  0. 


Consider  the  following  cases  for  each  realization. 

Case  i:  If  A4  >  T  then  by  Lemma  10.1,  (62)  holds. 

Case  ii:  If  A{4l)x}  >  T  and  .^A  <  T,  then  using  Lemma  10.1, 


^{.4uBUx}  -  ^{AuB}  -  ^{,4Ux}  +  T  >  ,\ x  -  A{4Ux}  +  T  >  -A^  +  T  >  0. 


Case  iii:  If  A{4uB}  >  T  and  A{4ux}  <  T,  then  using  Lemma  10.1, 


^{>tuSux}  —  ^  0. 

Case  iv:  If  A{4uBUx}  >  T  and  A{4uB}  <  T,  then 


A{^iuBux}  —  T  >  0. 


(62) 


(63) 


(64) 


(65) 


Therefore,  the  random  variable  operated  on  by  the  expected  value  operator  in  (62)  is  non-negative. 
Provided  any  event  that  causes  this  random  variable  to  have  a  positive  value  (as  the  ones  indicated) 
has  a  non-zero  probability  (which  is  true  for  non-degenerate  cases7),  (62)  is  satisfied,  thus  proving 
Property  10.1.  □ 

Case  (II):  Assume  the  A  and  B  are  not  disjoint  (A  C\  B  ^  0).  Bv  writing  {A  U  B }  as  the  union 
of  the  disjoint  sets  {.4}  and  {B  -  A)  and  using  the  results  from  Case  I  (60),  the  following  can  be 
written. 


b{AuB)  =  b{Au{B-A]  ~  1>A  +  b{B-A)  +  C^A^'^B  A *  (66) 

Similarly,  by  writing  set  B  as  the  union  of  the  disjoint  sets  {A  fl  B}  and  {B  —  A},  the  following 
can  be  written. 


bB  =  ^{.4nB}  +  b{B-A}  + 

or, 

b{B-A}  =  f>B  -  b{AnB}  -  C^AnB }’(B~AK 
Finally,  by  combining  (66)  and  (68),  the  following  is  obtained, 
b{AuB}  +  ^HB)  =  bA  +  bB  + 

If  A  C  B  then, 

c{A}.{B-A)  _  c{AnB},{B-A}  _  C{A),{B-A)  _  C{A},{B-A}  _  q 

'A  degenerate  case  is  when  there  is  no  packet  dropping  in  the  system,  that  is  bA  =  bs  =  b{AuB}  =  0- 


(67) 


(68) 


(69) 


(70) 
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Similarly  if  B  C  A.  If  A  <f_  B,  then  {Ar\  B)  C  A  and  by  Property  10.1, 


C{A},{B-A}  _  C{AnB},{B-A}  >  Q 
completing  the  proof  of  the  lemma.  □ 


(71) 


Lemma  10.3  The  sets  gj,  j  =  1,2 that  satisfy  equation  (29),  defining  the  vertices  of  the 
polytope  Vu ,  must  be  strictly  included  in  each  other;  that  is 

either  h3  =  g3  -  (g3  PI  gk)  =  0  or  /i*  =  gk  -  (s*  H  #)  =  0,  (72) 

for  all  gj  and  g m  □ 

Proof.  Let  and  g2  be  two  sets  satisfying  (29).  By  adding  the  corresponding  equations  the 
following  is  obtained. 

b{g\}  +  ^{g2}  —  22  “b  22<  =  22/  +  22  (73) 

l^9 1  i£92  i£{g\Ug2}  ^{51^2} 

or. 

E  di  =  b{9i)  +  b{92)  -  E  d'i-  (74) 

*€{5iU^2}  ^{91^92} 

Since  for  any  subset  of  5  (26)  must  be  satisfied,  then  for  {gi  n  g2},  Xie{Pln52}  d*i  -  &{5,ng2}>  the 
following  is  obtained  from  (74), 

E  d1  ^  6{pi)  +  b{92)  ~  6{pinfl2}-  (75) 

‘€{inut/2} 

Suppose  now  that  gi  and  g2  are  not  strictly  included  in  each  other.  That  is  hi  =  gi  -  (gi  D  g2) 
and  h2  -  g2  -  (g j  fl  52)  are  both  non-empty.  Then,  Lemma  10.2  implies  that, 

bh> >  +  b{32}  <  6{<nuS2}  +  b{ging2}  (76) 

or, 

^{51}  ”b  ^{92}  “  b{gir\g2}  ^  ^{g\Og2}'  (77) 

Therefore. 

22  <  b{gi}  +  &{P2}  —  ^{5ln52}  ^  ^{51^52}  (78) 

i€{9iVg2} 

or, 

E  d‘  <  b{9< u*>,  (79) 

i€{siU32} 

which  implies  d*  ^  Vu ,  this  is  a  contradiction  since  d*  is  a  vertex  of  Vu .  Thus  the  assumption  that 
gi  are  strictly  included  in  each  other  follows.  □ 
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Abstract 

Typical  rate-based  traffic  management  schemes  for  real-time  applications  attempt  to  allocate 
resources  by  controlling  the  packet  delivery  to  the  resource  arbitrator  (scheduler).  This  control 
is  typically  based  only  on  the  characteristics  of  the  particular  (tagged)  traffic  stream  and  would 
fail  to  optimally  adjust  to  non-nominal  network  conditions  such  as  overload.  In  this  paper,  a 
dynamic  Regulation  and  Scheduling  (dynamic-R&S)  scheme  is  proposed  whose  regulation  function 
is  modulated  by  both  the  tagged  stream’s  characteristics  and  some  information  capturing  the  state 
of  the  co-existing  applications  as  provided  by  the  scheduler.  The  performance  of  the  proposed 
scheme  -  as  well  as  the  equivalent  static  one  -  is  investigated  under  both  underload  and  overload 
traffic  conditions  and  the  substantially  better  throughput  /  jitter  characteristics  of  the  dynamic-R&S 
scheme  are  established. 
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1  Introduction 


A  secure  solution  to  the  problem  of  guaranteeing  the  QoS  of  real-time  applications  will  typically  require 
the  reservation  of  the  maximum  amount  of  needed  resources.  Because  of  the  anticipated  low  -  due  to 
resource  requirement  fluctuation  -  network  utilization,  alternative  solutions  are  being  considered  based 
on  “over  allocation”  of  resources  to  a  group  of  applications  (  multiplexing).  Over  allocation  aims  at 
improving  the  resource  utilization,  at  the  expense  of  failing  to  accommodate  at  certain  times  peak 
resource  demands  from  a  number  of  the  grouped  applications.  Or,  each  application  cannot  expect  to 
always  be  provided  resources  to  accommodate  its  current  demand.  Grouping  of  applications  allows 
for  allocation  of  the  resources  to  the  participating  applications  without  reserving  them  by  individual 
applications  and  potentially  wasting  them  if  not  used  by  this  application.  Grouping  of  applications 
and  “over-allocation”  of  resources  are  the  key  aspects  of  non-degenerate  statistical  multiplexing. 

A  basic  -  and  widely  considered  -  statistical  multiplexing  scheme  is  the  one  which  buffers  in¬ 
coming  packets  (or  cells)  in  a  First-Come,  First-Stored  fashion  and  schedules  them  in  a  First-Come, 
First-Served  fashion.  This  basic  scheme  -  or  simple  variations  to  it  -  have  been  adopted  and  studied 
extensively  in  traditional  data  networking  environments.  Because  of  the  stringent  QoS  requirements  of 
real-time  applications,  it  is  expected  that  such  “traditional”  statistical  multiplexing  schemes  will  not 
be  effective  for  such  applications.  It  is  well  understood  that  some  tighter  control  should  be  exercised 
on  input,  output  as  well  as  in  the  internal  processes  of  a  multiplexing  scheme  which  impact  on  its 
efficiency.  Through  (a)  sophisticated  call  admission  schemes  or  other  types  of  “weak”  resource  reser¬ 
vation  at  the  larger  time  scale  and  (b)  proper  traffic  regulation  and  service  scheduling  mechanisms  at 
the  smaller  time-scale,  statistical  multiplexing  will  potentially  provide  for  increased  network  utilization 
while  delivering  the  more  stringent  QoS  associated  with  real-time  applications.  This  paper  is  focused 
on  the  control  approaches  at  the  smaller  time  scale,  namely  regulation  and  scheduling. 

Typical  QoS  metrics  shaped  by  regulation  and  scheduling  schemes  are  described  in  terms  of  the 
induced  cell  loss/delay  and  the  allocated  bandwidth.  In  principle,  a  target  value  of  such  QoS  metric 
may  be  possible  to  achieve  through  either  tight  traffic  regulation  (typically  referred  to  as  rate-based 
approach),  or  sophisticated  scheduling  (typically  referred  to  as  scheduler-based  approach)  only.  In  most 
practical  cases  though,  some  scheduling  will  be  needed  to  resolve  transmission  conflicts  among  rate- 
based  controlled  applications.  Similarly,  some  traffic  filtering  (regulation)  will  be  needed  to  eliminate 
extreme  traffic  realization  which  would  be  hard  to  manage  even  by  a  sophisticated  scheduler  under 
a  scheduler-based  approach.  In  addition  to  the  supporting  role  of  regulation  in  a  scheduler-based 
approach,  and  scheduling  in  a  rate-based  approach,  higher  multiplexing  gain  may  be  achieved  by 
allowing  for  some  cooperation  between  the  two  functionalities  which  will  typically  co-exist  and  be 
designed  to  achieve  a  common  goal.  Such  a  dynamic  policy  is  investigated  in  the  present  paper. 

Substantial  effort  has  been  directed  toward  the  development  of  regulation  and  scheduling  schemes 
for  real-time  applications.  Rate-based  schemes  attempt  to  provide  for  the  target  QoS  metric  by  con¬ 
trolling  the  bandwidth  demands  of  the  supported  applications  through  rate  regulation;  some  simple 
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(static)  priority  FCFS  scheduling  may  be  employed  for  transmission  conflict  resolution.  Scheduler- 
based  schemes  typically  associate  deadlines  with  cells  and  attempt  to  provide  for  the  target  QoS  metric 
by  scheduling  service  in  order  of  increasing  deadlines.  Examples  of  regulation  and  scheduling  schemes 
for  real-time  applications  include  :  D-EDD  (Delay-Earliest  Due  Date)  [1];  J-EDD  (Jitter  Earliest  Due 
Date)  [2],  [3];  HRR  (Hierarchical  Round  Robin)  [4];  S&G  (Stop  and  Go  Queuing)  [5];  WFQ  (Weighted 
Fair  Queuing)  [6];  PGPS  (Packet  Generalized  Processor  Sharing)  [7];  RCSP  (Rate  Controlled  Static 
Priority)  [8];  LIT  (Leave  In  Time)  [9];  MRTS  (Multi  Rate  Traffic  Shaping)  [10];  VC  (Virtual  Clock) 
[11], 

The  traffic  management  scheme  for  real-time  applications  investigated  in  this  work  may  be  viewed 
as  an  enhancement  of  the  Rate  Controlled  Static  Priority  (RCSP)  scheme  proposed  in  [8].  Under 
the  RCSP  scheme  each  traffic  stream  passes  through  a  regulator  which  restores  the  traffic,  completely 
or  partially,  based  on  the  traffic  description  and  the  type  of  regulator  used.  The  restored  traffic 
is  handed  over  to  the  respective  priority  queue  and  is  scheduled  in  FCFS  order.  The  rate  jitter 
regulator  employed  in  [8]  is  based  on  cell  eligibility  times  (ET)  defined  as  follows:  ET\  =  AT\\  ET \  = 
max{ETk-\  +  T- f  r*,  AT/J,  k  >  1  where  AT  denotes  the  cell  arrival  time,  and  r*  is  a  term  used  to 
provide  the  average  rate;  subscripts  indicate  packets.  Rate  jitter  is  controlled  with  respect  to  the  ET 
of  the  previous  packet  of  the  same  connection.  T  is  the  minimum  cell  inter  arrival  time  specified  by 
the  source.  The  idea  is  to  hold  cells  so  that  minimum  inter  departure  time  be  enforced. 

An  apparent  drawback  of  schemes  such  as  the  RCSP  is  that  the  regulator  and  scheduling  functions 
are  separated.  It  is  expected  though  that  the  throughput  /  jitter  of  a  regulated  tagged  stream  will 
potentially  be  substantially  modulated  at  the  scheduler  by  the  cumulative  activity  of  the  co-existing 
traffic  streams.  As  a  consequence  the  effectiveness  of  the  Regulation  and  Scheduling  (R&S)  scheme  may 
be  compromised  significantly.  This  problem  can  be  addressed  to  some  extent  by  dynamically  adjusting 
the  regulator  behavior  based  on  state  information  fed  back  from  the  scheduler  to  the  regulator. 

In  section  2  the  proposed  dynamic  R&S  (dynamic-R&S)  policy  is  motivated  and  presented,  along 
with  the  equivalent  static  R&S  (static-R&S)  policy  on  which  a  comparative  study  will  be  based.  In 
section  3  the  behavior  of  the  tagged  regulator  is  investigated  under  both  policies.  In  section  4,  the 
behavior  of  the  scheduler  is  studied  under  underload  conditions  at  the  scheduler.  The  tagged  cell  in¬ 
terdeparture  process  at  the  scheduler  is  derived  by  formulating  and  analyzing  an  interesting  queueing 
system  in  which  arrivals  depend  on  the  queue  occupancy.  Based  on  this  process,  the  throughput  / 
jitter  performance  of  the  policies  is  evaluated.  In  section  5  the  throughput  /  jitter  performance  of  the 
policies  is  investigated  under  overload  conditions  at  the  scheduler.  Numerical  results  illustrating  the 
performance  of  the  dynamic-R&S  and  static-R&S  policies  under  underload  and  overload  conditions  at 
the  scheduler  are  presented  in  section  C.  These  results  are  based  on  the  analytical  studies  presented 
earlier  in  the  preceding  sections  and  are  in  accordance  with  expectations.  Finally,  some  of  the  as¬ 
sumptions  made  in  order  to  facilitate  the  analysis  are  relaxed  in  section  7  where  simulation  results  are 
presented  involving  multiple  sources  subject  to  the  R&S  policies  considered  in  this  paper. 
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2  The  Proposed  Dynamic  R&S  Policy 


The  typical  primary  objective  in  regulating  real-time  traffic  stream  within  the  network  is  to  control 
jitter  or  the  instantaneous  rate  (throughput).  This  is  achieved  in  the  RCSP  mechanism  [8]  by  enforc¬ 
ing  a  minimum  spacing  at  the  output  of  the  regulator  associated  with  the  traffic  stream  of  interest 
(tagged  traffic  stream).  Fig.  1  (without  the  feedback  loop)  shows  a  block  diagram  of  an  architecture 
implementing  a  RCSP  mechanism  where  each  of  the  N  multiplexed  streams  is  regulated  before  it  is 
considered  for  transmission. 


Regulator  Block 


Figure  1:  The  dynamic- R&S  (including  the  arrow)  and  the  static-R&S  (excluding  the  arrow)  Systems 

Since  scheduling  conflicts  will  arise  when  more  than  one  regulated  applications  are  present,  a 
scheduler  needs  to  be  employed  to  resolve  these  conflicts.  A  consequence  of  the  scheduling  conflicts  is 
that  the  tagged  traffic  stream  at  the  output  of  the  scheduler  will  be  a  distorted  version  of  the  target 
stream  enforced  at  the  output  of  the  regulator.  For  instance,  although  a  minimum  spacing  between 
consecutive  tagged  cells  is  enforced  at  the  output  of  the  tagged  regulator  in  Fig.  1,  this  does  not 
hold  true  for  the  tagged  stream  at  the  output  of  the  scheduler.  This  clustering  is  generated  due  to  an 
increased  arrival  rate  to  the  scheduler  in  the  immediate  past  which  has  pushed  back  (delayed)  earlier 
tagged  cells.  Due  to  the  latter,  some  spreading  followed  by  some  clustering  of  tagged  cells  is  expected 
to  be  observed  at  the  output  of  the  scheduler. 

The  tagged  cell  spreading  mentioned  above  can  be  reduced  by  monitoring  the  scheduler  and  re¬ 
leasing  a  tagged  cell  before  its  eligibility  time  1  when  scheduler  queue  build  ups,  which  will  cause  the 
spreading,  are  detected.  The  dynamic  Regulation  and  Scheduling  (dynamic-R&S)  scheme  proposed 
below  attempts  to  provide  for  a  smoother  tagged  traffic  at  the  output  of  the  scheduler  based  on  this 

^ere  defined  as  T  time  units  following  the  previous  tagged  cell  release,  if  a  minimum  spacing  of  T  is  targeted. 
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idea. 

Although  less  commonly  stated,  another  objective  in  regulating  real-time  traffic  streams  within 
the  network  is  to  control  (limit)  the  amount  of  bandwidth  that  is  demanded  by  traffic  streams.  This  is 
a  mechanism  to  protect  other  real-time  applications  from  excessive  bandwidth  consumption  by  “mis¬ 
behaving”  ones.  This  “misbehavior”  is  typically  due  to  the  traffic  fluctuations  (burstiness).  Although 
severe  traffic  fluctuations  will  be  eliminated  by  the  regulator  (s),  some  traffic  burstiness  will  remain 
since  the  regulator(s)  will  not  typically  attempt  to  transform  Variable  Bit  Rate  (VBR)  to  Continuous 
Bit  Rate  (CBR)  traffic,  due  to  the  delay  constraints.  For  instance,  remaining  burstiness  is  present  in 
a  traffic  stream  regulated  by  a  leaky  bucket,  or  when  high  multiplexing  gain  is  to  be  achieved.  Such 
controlled  burstiness  will  potentially  be  “absorbed”  in  part  by  the  multiplexing  process. 

When  the  arrival  rate  to  the  scheduler  increases  due  to  (residual)  traffic  burstiness,  spreading  of 
the  tagged  cells  at  the  output  of  the  scheduler  is  expected  to  be  observed,  as  indicated  again  above. 
This  spreading  represents  an  instantaneous  reduction  in  the  bandwidth  allocated  to  the  tagged  traffic 
stream,  as  measured  at  the  output  of  the  scheduler.  In  the  context  of  the  bandwidth  availability  to 
the  tagged  traffic  stream,  the  dynamic-R&S  scheme  proposed  below  may  be  viewed  as  attempting  to 
provide  for  a  constant  bandwidth  availability  to  the  tagged  traffic  stream  at  periods  of  excessive  total 
bandwidth  demand  from  the  co-existing  applications. 

A  simple  architecture  of  the  switch  for  the  illustration  of  the  proposed  policy  is  shown  in  Fig. 
1.  Similarly  to  the  architecture  proposed  in  [8],  each  of  the  N  supported  real-time  applications  is 
regulated  at  a  logically  dedicated  regulator  before  it  is  delivered  to  the  scheduler.  In  the  present 
work,  a  simple  FCFS  scheduling  policy  is  being  considered.  This  scheduling  mechanism  is  the  simplest 
possible,  reducing  the  scheduling  complexity  to  single  queue  buffering. 

Under  the  dynamic-R&S  policy  proposed  below,  the  regulation  process  is  modulated  by  some 
scheduler  status  information.  Unlike  in  past  work  in  the  area,  appropriate  information  regarding  the 
status  of  the  scheduler  (FCFS  queue)  is  fed  back  to  the  regulators,  as  indicated  in  Fig.  1  with  the 
feedback  arrow.  As  explained  below  the  time  of  delivery  of  the  cell  from  regulator  i  to  the  scheduler 
initiates  a  cycle  of  scheduler  status  monitoring  for  regulator  i\  this  cycle  is  completed  with  the  delivery 
of  the  next  cell  from  regulator  i  to  the  scheduler,  initiating  the  next  monitoring  cycle.  The  time  at 
which  a  tagged  cell  is  shifted  from  the  regulator  to  the  scheduler  is  determined  by  the  tagged  cell  release 
policy  described  below. 

2.1  The  tagged  cell  release  policy:  dynamic-R&S  scheme 

Let  tk  denote  the  time  slot  at  which  the  kth  tagged  cell  is  released  from  the  tagged  regulator.  Let  Qrk 
denote  the  queue  occupancy  at  the  regulator  upon  (following)  the  release  of  the  kth  cell.  Let  t k  +  B k 
(Bk  >  1)  denote  the  time  slot  at  which  the  cumulative  number  of  non-tagged  arrivals  (releases)  to 
the  scheduler  following  tk  exceeds  T  —  2  for  the  first  time.  Let  a  superscript  d  (s)  indicate  a  quantity 


4 


associated  with  the  dynamic-R&S  (static-R&S)  policy  and  let 


Wk=min{Bk,T}  or  W  =  min{B,T}, 


(1) 


where  the  last  expression  involves  the  generic  random  variables  W  and  B.  The  ( k  4-  l)st  tagged  cell 
release  time  4+1  is  given  by 


tk+l  =  tk  +  Wk  +  Hdk*  1 


{Ql=0,AV  *=0}’ 


(2) 


where  Hf.  denotes  the  time  interval  between  h  =  tk  +  Wk  and  the  first  tagged  cell  arrival  following 
tk;  T  is  a  constant  positive  integer;  A f’i  is  the  number  of  cell  arrivals  to  the  dynamic  regulator  over  j 
slots. 

If  T  is  equal  to  the  minimum  spacing  among  consecutive  tagged  cell  releases  from  the  regulator 
in  the  RCSP  scheme  [8],  then  it  is  easy  to  see  that  the  above  release  policy  will  accelerate  the  tagged 
cell  releases  from  the  regulator  at  times  when  a  minimum  spacing  of  T  at  the  output  of  the  scheduler 
would  be  violated.  This  acceleration  occurs  when  Bk  <  T. 

It  is  expected  that  the  tagged  cell  release  acceleration  will  have  a  positive  impact  on  the  tagged 
cell  delay  jitter  and  availed  bandwidth.  To  quantify  such  benefits  the  static-R&S  scheme  is  considered 
in  parallel  in  the  rest  of  the  paper.  As  described  below  its  tagged  cell  release  policy  is  not  modulated 
by  any  scheduler  status  information.  A  simple  FCFS  scheduler  is  also  considered. 


2.2  The  tagged  cell  release  policy:  static-R&S  scheme 

By  employing  the  definitions  presented  above  and  replacing  Wk  =  min{BktT}  by  T,  the  ( k  4-  l)st 
tagged  cell  release  time  tk+\  is  given  by: 

4+i  =tk  +  T  +  Hk*  l{Qr=o,+;  r^o} 

where  Hk  denotes  the  time  interval  between  Tk  =  tk  +  T  and  the  first  tagged  cell  arrival  following  tk\ 
ApJ  is  the  number  of  cell  arrivals  to  the  static  regulator  over  j  slots. 


3  Study  of  Regulator  Behavior 

The  behavior  of  the  R&S  schemes  is  evaluated  by  investigating  their  impact  on  a  specific  stream  (tagged 
stream).  The  traffic  at  the  output  of  the  regulators  associated  with  the  remaining  N  —  1  applications  is 
aggregated  and  forms  the  background  traffic  which  competes  with  the  tagged  traffic  for  resources  at  the 
scheduler.  Let  Ak  denote  the  number  of  background  cells  delivered  to  the  scheduler  over  k  consecutive 
slots;  let  Aj  denote  the  number  of  background  cells  delivered  to  the  scheduler  in  the  ith  slot  (Note  that 
Ak  =  £i=i  Ai). 

The  improved  performance  of  the  dynamic-RfcS  policy  over  the  static-R&S  policy  will  be  estab¬ 
lished  by  evaluating  the  cell  interdeparture  process  at  the  output  of  the  scheduler.  Since  the  input 
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process  to  the  scheduler  is  the  output  process  from  the  regulator,  it  is  important  that  the  latter  be 
investigated  to  both  gain  insight  into  the  combined  system  (regulator  plus  scheduler)  behavior  as  well 
as  evaluate  the  output  process  at  the  scheduler. 

The  basic  operational  difference  between  the  dynamic-R&S  and  static-R&S  schemes  is  captured 
by  the  tagged  cell  interdeparture  process  from  the  regulator  {14}fc>i,  where  14  =  4+ 1  —  4-  In  view  of 
(2)  and  (3)  it  is  easy  to  establish  that  the  evolution  of  the  tagged  cell  process  {Vk}k>\  is  described  by 

Vj*  —  T  +  Hsk*  l{Qr_0  4».t=0}  (static-R&S  scheme)  (4) 

Vk  =Wk  +  H%*  l{Q,=Md,w*=0}  (dynamic-R&S  scheme)  (5) 

In  order  to  decouple  the  intrinsic  behavior  -  to  be  investigated  in  this  paper  -  of  the  R&S  schemes 
from  the  source  load,  the  heavy  traffic  source  assumption  will  be  made  throughout  the  paper.  This 
assumption  is  consistent  with  standard  ones  made  in  order  to  determine  the  throughput  capabilities 
of  a  scheme  as  well  as  the  throughput  fluctuations  (jitter),  without  the  noise  introduced  by  source 
inactivity  periods.  Under  the  heavy  traffic  source  assumption  the  indicator  function  in  (4)  and  (5)  is 
always  zero  and,  thus, 


h 

II 

5? 

(static-R&S) 

(6) 

£ 

ii 

(dynamic-R&S) 

(7) 

The  following  proposition  describes  the  generic  random  variable  W. 

Proposition  1  :  The  probability  mass  function  of  W  and  its  mean  are  given  by: 

Pr{W  =  j}  —  Fj-\(T  —  2)  —  Fj{T  —  2)  ,  l<i<T  (8) 

E{W)  =  YtFj.x{T-2)  (9) 

where  F0{T  -  2)  d=  1,  Ft{T  -  2)  d=  0  and  Fj(T  -  2)  d=  Fj{T  -  2),  1  <j<  T,  where  Fj(-)  denotes  the 
j-fold  convolution  of  the  probability  distribution  function  of  A1, 

The  proof  of  this  proposition  along  with  the  proofs  of  other  propositions  and  corollaries  that  follow 
may  be  found  in  the  Appendix. 

Proposition  1  describes  the  regulator  interdeparture  process  of  the  dynamic-R&S  scheme  in  terms 
of  the  first  order  probability  mass  function  and  the  first  moment.  This  process  will  be  employed  in 
the  study  of  the  scheduler  under  the  dynamic-R&S  scheme.  Its  description  is  also  employed  in  the 
following  comparative  study  of  the  two  policies. 


6 


Proposition  2  ; 

policies  is  given  by 


The  maximum  throughput  (output)  rate  of  the  tagged  regulator  under  the  two 


ftS 

Ixmax 


1 

T 


(10) 


Ri 


1  _  1 
E{W}  ~  J2j=i  Ej-i{T  —  2) 


(11) 


Notice  that  R^ax  >  Rsmax  with  equality  only  when  the  background  traffic  process  can  never  deliver  more 
than  T  —  2  cells  over  T  —  1  consecutive  slots  (typically,  a  zero  probability  event). 


The  above  discussion  establishes  that  the  dynamic-R&S  scheme  will  respond  to  a  sudden  increase 
of  the  background  load  2  by  increasing  its  rate  above  the  targeted  rate  of  1/T,  in  an  effort  to  ensure 
that  the  targeted  tagged  rate  at  the  output  of  the  scheduler  is  achieved.  The  impact  of  such  a  reaction 
(which  is  not  possible  under  the  static-R&S  scheme)  on  the  scheduler  output  process  is  investigated  in. 
the  next  section. 


4  Study  of  Scheduler  Behavior  under  Underload  Conditions 

In  this  section,  the  tagged  cell  interdeparture  process  from  the  scheduler  is  derived  in  order  to  evaluate 
the  throughput  /  jitter  properties  of  the  R&S  schemes.  Since  the  deliverability  of  the  QoS  of  the 
tagged  application  (which  is  considered  to  be  a  measure  of  the  induced  throughput  /  jitter)  is  expected 
to  be  decreased  under  high  load  conditions  at  the  scheduler,  the  latter  will  be  studied  under  high 
load  conditions  both  below  (underload)  and  above  (overload)  the  scheduler  capacity.  Although  the 
call  admission  control  function  will  attempt  to  minimize  the  occurrence  of  temporary  overload  at  the 
network  nodes,  due  to  traffic  burstiness  and  the  desire  to  achieve  high  resource  utilization  through 
statistical  multiplexing,  it  is  expected  that  temporary  overload  will  be  unavoidable.  It  is  under  such 
conditions  that  traffic  management  mechanisms  should  not  only  not  collapse,  but  rather  minimize  the 
impact  of  the  overload  on  the  QoS.  In  order  to  evaluate  the  tagged  output  process  at  the  scheduler  as 
shaped  by  the  R&S  policy  as  opposed  to  buffer  overflows,  sufficiently  large  buffer  will  be  assumed  to  be 
available  at  the  scheduler  throughout  this  paper,  to  eliminate  the  interference  from  the  buffer  overflow 
process. 

Under  underload  conditions  (p  <  1)  the  scheduler  queue  is  stable  and  since  no  cell  overflow  is 
possible  the  following  relationships  between  the  maximum  regulator  ( Rmax )  and  scheduler  ( Smnx ) 
throughputs  hold. 

Rdmax  =  Sdmax  and  Rsmax  =  Ssmax  (12) 

In  view  of  the  above  and  Proposition  2  the  following  corollary  is  self  evident. 

2which  would  result  in  an  instantaneous  reduction  of  the  tagged  throughput  at  the  output  of  the  scheduler 
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Corollary  1  :  5^ax  ^  ^max*  where  5mnx  denotes  the  maximum  tagged  cell  output  rate  from  the 

scheduler ,  or  the  maximum  tagged  cell  throughput. 

Corollary  1  implies  that  potentially  higher  throughput  will  be  achieved  by  the  tagged  applica¬ 
tion  under  the  dynamic-R&S  policy  compared  to  that  under  the  static-R&S  policy.  This  increase  in 
throughput  is  due  to  a  regulator  tagged  interdeparture  interval  W  less  than  T  under  the  dynamic-R&S 
policy,  occurring  only  if  the  tagged  cell  interdeparture  from  the  scheduler  is  to  exceed  the  tagged  value 
of  T.  Since  the  R&S  policy  attempts  to  minimize  the  variation  of  the  target  spacing  T  between  con¬ 
secutive  tagged  cell  departures  (or  minimize  the  deviation  of  the  instantaneous  tagged  cell  output  rate 
from  1/T),  the  deviation  of  the  tagged  cell  interdeparture  interval  from  the  target  T  will  be  employed 
as  a  jitter  metric.  The  precise  study  of  the  tagged  interdeparture  process  at  the  output  of  the  scheduler 
-  denoted  by  {Xk}k>i  -  is  presented  in  the  next  subsections  under  both  policies. 

To  avoid  unnecessary  complications  as  well  as  evaluate  the  performance  of  the  policies  under 
non-idling  environment  -  in  which  case  they  differ  -  the  heavy  traffic  source  assumption  is  employed 
here  for  the  tagged  traffic  at  the  input  of  the  regulator,  as  indicated  earlier.  That  is,  the  tagged  cell 
release  to  the  scheduler  process  will  be  dependent  only  on  the  policy  and  the  scheduler  state  and  not 
on  the  regulator  queue  occupancy;  the  latter  is  considered  to  be  non-empty  under  the  heavy  traffic 
assumption. 

4.1  Tagged  cell  interdeparture  under  the  dynamic-R&S  policy 

A  queueing  model  for  the  system  of  the  tagged  traffic  regulator  and  scheduler  is  shown  in  Fig.  2; 
the  regulator  (CR)  and  scheduler  ( Cs )  buffer  capacities  are  assumed  to  be  theoretically  infinite  as 
mentioned  earlier.  As  indicated  earlier,  the  heavy  traffic  assumption  implies  the  regulator  will  not 
starve  and  thus  Vk  depends  only  on  Wk  —  min{Bk,T},  where  Bk  is  the  shortest  time  interval  (in  slots), 
initiated  upon  the  release  of  the  kth  tagged  cell  and  terminated  at  the  slot  in  which  the  cumulative  non 
tagged  (or  background)  arrivals  over  this  interval  exceed  T  -  2. 

As  implied  in  Fig.  2,  N  —  1  traffic  streams  -  potentially  controlled  by  a  similar  R&S  scheme  - 
are  assumed  to  compete  for  the  common,  slotted  transmission  resource.  For  analysis  tractability,  the 
cumulative  arrivals  from  the  N  —  1  co-existing  traffic  streams  -  forming  the  background  traffic  -  are 
assumed  to  be  independent  (per  slot)  and  identically  distributed,  represented  by  the  random  variable 
A1  (see  earlier);  the  maximum  number  of  background  arrivals  per  slot  is  equal  to  N-l. 

Fig.  3  illustrates  the  sequence  of  events  associated  with  a  slot,  as  considered  in  this  study,  and 
presents  other  time-related  conventions.  A  cell  departure  (if  any)  is  assumed  to  occur  first,  followed 
by  the  cumulative  background  arrivals  over  this  slot  (if  any)  and  then  the  tagged  cell  arrival  (if  any). 
Xk  —  1  slots  are  available  to  the  background  traffic  between  two  consecutive  tagged  cell  departures  with 
interdeparture  interval  equal  to  Xk . 

The  scheduler  interdeparture  time  between  tagged  cells  k  and  k  +  1,  Xk ,  can  be  determined  based 
on  the  scheduler  queue  occupancy,  Qk ,  found  upon  arrival  of  the  tagged  cell  k  to  the  scheduler  queue 
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Figure  2:  Queuing  model  for  the  tagged  traffic  study  at  the  scheduler 


(not  counting  itself).  This  is  described  next.  Later,  the  stationary  probabilities  of  Qk  are  derived  to 
complete  the  calculation  of  the  probability  distribution  of  AV 
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Figure  3:  Sequence  of  events  in  the  scheduler  output  slot  m  :  (1)  cell  departure;  (2)  background  cell 
arrivals;  (3)  tagged  cell  arrival 


A  rarely  encountered  peculiarity  of  the  scheduler  queue  is  that  tagged  cell  arrivals  depend  on  the 
queue  build  up,  or,  the  queue’s  arrival  process  depends  on  its  occupancy  process.  Specifically,  tagged 
cell  arrival  k  +  1  occurs  T  slots  following  the  arrival  of  tagged  cell  k  if  and  only  if  the  cumulative 
background  arrivals  following  (“behind”)  the  tagged  cell  k  arrival  have  not  exceeded  T  —  1  at  any 
earlier  slot.  Otherwise,  tagged  cell  k  +  1  will  arrive  (be  released  from  the  regulator)  at  that  earlier  slot. 
The  tagged  cell  k  will  be  served  in  (during)  the  (Qk  +  l)st  slot  following  its  release  from  the  regulator. 
The  following  proposition  provides  for  the  conditional  value  of  Xk  given  Q*. 

Proposition  3  The  conditional  probability  of  Xk  given  Qk ,  Pr{Xk  =  T  +  l/Qk  —  i}  for  —{T  —  1)  < 
/  <  (TV  —  2),  is  given  by  the  following  expressions: 

Case  I :  l  >  0  (no  clustering) 

min{i+l,T} 

Pr{Xk  =  T  +  l/Qk  =  i}  =  Y,  Pr{Am  =  T  +  1  —  1,  Am~x  <  T  —  2}  + 

m=  1 
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(13) 


T-2 

+  l{t<T-2}  *  P r =  m,  Xk  =  T  4-  1} 

m— 0 

Case  II :  l  <  0  (clustering) 

Pr{Xk  =  T  +  l/Qk=i}  =  l{i>r_1}*PrMr  =  T  +  /-l}  + 

T-2 

+l{i<r-2}  *  E  M^<+1  =  m,  =  T  +  /}  (14) 

771=0 

where  the  probabilities  involved  in  the  above  expressions  are  derived  and  evaluated  in  the  proof  of  this 
proposition.  □ 

The  transition  probabilities  of  the  scheduler  occupancy  process  {Qk}k>i  are  given  in  the  following 
proposition. 

Proposition  4  The  transition  probabilities  of  the  Markov  Process  {Qk},  embedded  upon  tagged  cell 
arrival  times,  are  given  by  the  following  expressions  : 

Case  I :  i  >T  —  1 

Pr{Qk+i  =  j/Qk  =  i}  = 

=  Pr{Al  =  j  —  i}  *  l{T-i<j-t<A'-i}n{T-i<yv-i}  + 

7-1 

+  E  Pr{Am  =  j— i  —  1+m,  Am~X  <T  —  2}  *  l{T-m<j-i<T-2+N-m}n{T-l<m*(N-l)}  + 
m= 2 

+  Pr{AT  —  j  —  i  —  1+T}*  1{— (T— i)<j-j<Ar— 2} 

Case  II :  i  <  T  —  2 

Pr{Qk+ 1  =  j/Qfc  =  *}  = 

Pr{Al  =  j  —  i}  *  l{T-i<j-t<JV-i}n{T-i<N-i}  + 

j+i 

+  E  =  —  l+^l,  j4m-1  <  T  —  2}  *  1{T —m<j—i<T  —2+N—m}n{T — l<m*(Af— 1)}  + 

m= 2 

T-2  T—i—2  T-3+jV-m 

+  E  E  E  Pr"{i(n)  =  an,Q{n)  =  j/i(0)  =  0,Q(0)  =  m} 

771  =  0  71=1  an=T- 1-771 

*  Pr{./4,'+1  =  m,  i4*  <  T  —  2}  + 

T-2  T-3+7V-m 

+  E  E  Prr-,-1{i(T-i-l)=a,Q(T-i-l)  =  i/i(0)  =  0,Q(0)=m}* 

771=0  a=0 

*  Pr{Ai+l  =m,Ai  <T  -  2}D  (15) 

Finally  the  tagged  cell  interdeparture  probability  distribution  can  be  obtained  from  the  conditional 
ones  (Proposition  3)  and  the  stationary  probabilities  of  {Qk},  Kq(i),  derived  by  employing  the  transition 
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probabilities  given  in  Proposition  4.  Thus, 

oo 

Pr{Xk  =  T  +  0  =  E  Prixk  =  T  +  VQk  =  *}  *  **(*) 

z=0 

T- 2 

=  E  =  T  +  l/Qk  =  i)  *  + 

i=0 

+[1  -  E  *  PriX k  =T  +  l/Qk  >T-  1}  (16) 

1=0 

Since  the  conditional  interdeparture  given  Qk  =  ifori>  T- 1  is  constant,  independent  from  i  (see 
Proposition  3),  equation  (16)  suggests  that  since  only  the  stationary  probabilities  nq(i)  for  i  <  T  -  2 
are  used,  truncating  {Qk}k>i  to  some  state  T  -  2  +  L,  for  some  large  L,  will  cause  negligible  impact 
on  the  7 Tq(i)  for  i  <  T  —  2  since  only  values  away  from  the  truncation  boundary  are  used  in  (16). 

4.2  Tagged  cell  interdeparture  under  the  static-R&S  policy 

Under  the  static-R&S  policy,  no  early  tagged  cell  releases  are  allowed,  and  therefore  the  scheduler  is 
fed  by  a  periodic  tagged  traffic  of  period  T  (heavy  traffic  assumption  at  the  regulator). 

Proposition  5  :  The  probability  distribution  of  the  jitter  for  the  static-R&S  policy  is  given  by: 

OO 

Pr{Xk  =  T  +  1}  =  E  Pr{Qk+i  =  i  +  l/Qk  =  t}  *  nq(i)  (17) 

l=i 

The  transition  probabilities  Pr{Qk+\  =  j/Qk  =  *}  can  be  obtained  as  the  T-step  transition 
probabilities  of  a  simple  M/D/1  queue.  Finally  by  employing  the  transition  probabilities,  the  stationary 
distribution  rrq{i)  can  be  obtained. 

5  Study  of  Scheduler  Behavior  under  Overload  Conditions 

Since  the  traffic  streams  which  are  traversing  a  single  multiplexer  can  be  of  highly  variable  rates,  it  is 
possible  -  for  a  short  period  of  time  -  to  have  an  average  arrival  higher  than  1.  If  this  condition  were  to 
persist,  then  the  buffers  would  grow  without  bound  and  the  system  would  be  unstable.  It  is  desirable 
to  study  the  behavior  of  the  policies  under  such  extreme  conditions.  The  case  in  which  the  scheduler 
queue  is  unstable  is  considered  next.  That  is,  p  >  1. 

Although  the  scheduler  queue  capacity  is  again  assumed  to  be  infinite  and,  for  that  matter,  no 
overflow  will  occur,  the  relationship  in  (12)  will  not  hold  since  traffic  will  accumulate  in  the  infinite 
queue.  This  ever  increasing  queue  built  up  will  represent  a  difference  between  input  and  output  traffic 
rate,  making  it  hard  to  assess  the  precise  throughput  achieved  by  the  tagged  or  background  streams. 

In  view  of  the  large  buffer  assumption  at  the  scheduler  it  is  evident  that  the  scheduler  queue 
occupancy  upon  tagged  cell  arrival,  Qk,  will  always  exceed  T  —  1  under  overload  conditions  (p  >  1).  As 
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a  consequence  —  as  it  will  become  evident  below  —  the  analysis  of  the  scheduler  tagged  cell  interdeparture 
process  is  simplified  significantly.  For  this  reason  both  policies  are  treated  concurrently. 

The  following  proposition  provides  for  the  precise  description  of  the  tagged  stream  interdeparture 
distribution  at  the  output  of  the  scheduler  under  overload  conditions. 

Proposition  6  .'  Under  the  overload  conditions  at  the  scheduler  queue  (p  >  \)  and  infinite  sched¬ 
uler  buffer  capacity,  the  distribution  of  the  tagged  cell  interdeparture  at  the  scheduler,  Xk,  under  the 
dynamic-R&S  policy,  is  given  by: 

T 

Pr{Xk  =  0  =  E  PrMm  =  1-1,  ^m_1  <T  —  2}  *  1  {T</<  N+T— 3}  + 

m=l 

+Pr{AT  =  l  —  1}  *  l{i<i<r-i)  /or  1  <  1  <  AT  +  T-3;  (18) 

the  distribution  under  the  static-R&S  policy  is  given  by  : 

Pr{X, t  =  1}  =  Pr{AT  =  1-1}  for  1<1<(N  —  1)*T  +  1  (19) 

Proposition  6  can  be  employed  in  deriving  the  throughput  of  the  tagged  application.  The  following 
proposition  establishes  the  better  jitter  and  smoothness  characteristics  of  the  dynamic-R&S  scheme, 
compared  to  those  of  the  static-R&S  scheme  under  overload  conditions  at  the  scheduler. 

Proposition  7  :  Under  overload  conditions  at  the  scheduler  the  dynamic-R&S  policy  will  poten¬ 

tially  reduce  the  tagged  cell  spreading  (while  it  will  never  increase  it)  compared  to  the  static- R&S  policy. 
That,  is, 

Pr{Xk  >  m}  <  Pr{Xk  >  m}  for  m>T. 

The  tagged  cell  clustering  will  be  identical  under  both  policies,  that  is, 

Pr{Xdk  <  m}  =  Pr{Xsk  <  m}  forl<m<T. 

and 

Pr{Xk  =T}>  Pr{Xsk  =  T } 

The  following  proposition  establishes  some  insightful  relationships  between  the  moments  of  inter¬ 
departure  and  the  background  traffic  process. 

Proposition  8  ;  Under  overload  conditions  at  the  scheduler  the  following  can  be  shown  : 

(a)  E{Xl}=T*E{Al}  +  l  =  T*{N-l)*pb  +  l 

(b)  VAR{Xsk}  =T*VAR{A1}  =  T*{N -l)*pb*{l-pb) 

(c)  E{Xf)  =  E{W}  *  E{A1}  +  1  =  E{W }  *{N-\)*pb+l 

Where  the  last  part  of  the  above  equations  is  derived  for  a  binomial  random  variable  Ax  with  maximum 
value  N  -  1  and  success  probability  pb;  V AR{x}  denotes  the  variance  of  random  variable  x. 
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Corollary  2  ;  The  throughput  gain  of  the  dynamioR&S  scheme  over  that  of  the  statioR&S 
scheme  under  overload  conditions  increases  as  the  load  increases  and  its  asymptotic  value  is  equal  to 
T .  That  is  : 

i  IHxj)  r  +  i/EM1}  B{A^ 

l/E{X°k}  E{W}  +  1/E{A'} 

6  Numerical  Results 

In  this  section,  some  numerical  results  are  presented  to  quantify  the  behavior  of  the  dynamic-R&S  and 
static-R&S  schemes.  The  results  are  derived  under  heavy  traffic  source  load  at  the  regulator  and  both 
underload  and  overload  conditions  at  the  scheduler.  Although  the  heavy  traffic  source  and  overload 
conditions  are  not  the  dominant  ones  in  a  well  designed  system,  they  will  be  present  if  substantial 
statistical  multiplexing  gain  is  to  be  achieved.  And  while  simple  regulator  schemes  -  such  as  the  static- 
R&S  one  -  may  be  adequate  under  nominal  (underload)  traffic  conditions,  it  is  important  that  their 
behavior  under  less  frequent  -  but  QoS  compromising  -  overload  conditions,  be  investigated.  The  results 
shown  here  indicate  that  the  dynamic-R&S  scheme  proposed  in  this  paper  outperforms  the  static-R&S 
and,  importantly,  can  cope  with  overload  conditions  better  than  the  static-R&S  one. 

The  results  presented  below  have  been  obtained  for  two  values  of  the  target  interdeparture  time 
T  -  or  desirable  throughput  1/T  -  equal  to  T  -  5  and  T  =  10.  The  background  traffic  is  modeled 
as  an  independent  per  slot,  batch  process  with  binomially  distributed  batch  size  of  maximum  value 
N  —  1  =  8  and  success  probability  pb-  The  background  load  or  utilization  is  denoted  by  Pback-  I*1  a 
system  were  all  -  or  a  large  subset  -  of  the  connections  are  regulated,  the  background  process  will  not  be 
independent  per  slot.  This  assumption  is  made  for  analysis  tractability,  since  modeling  the  correlation 
in  the  background  would  make  the  analysis  of  the  system  impossible.  Also  the  above  assumption  is 
not  completely  unrealistic,  since  in  each  slot  a  number  of  eligibility  times  will  mature  and  a  batch  of 
cells  will  be  delivered  to  the  scheduler. 

Fig.  G  presents  the  regulator  throughput  vs  Pback  under  both  policies,  obtained  from  Proposition  2. 
As  pback  increases,  the  dynamic-R&S  policy  can  detect  the  increased  background  intensity  and  release 
cells  earlier,  attempting  to  provide  the  targeted  tliroughput  (1/T)  and  control  jitter.  As  a  consequence, 
the  rate  by  which  the  packets  leave  the  regulator  increases  as  the  background  intensity  increases  and 
it  will  reach  a  maximum  of  1  if  at  least  T  -  1  background  cells  are  delivered  to  the  scheduler  in  each 
slot  (high  overload  at  the  scheduler).  The  scheduler  throughput  Smax  vs  pback  is  shown  in  Fig.  7  for 
both  policies  under  underload  conditions  (p  <  1).  Smax  is  calculated  as  1  jE{X^}\  the  Probability 
Mass  Function  (PMF)  of  Xk  is  calculated  from  (1G)  for  the  dynamic  policy,  and  from  (17)  for  the  static 
policy.  As  expected  from  (12),  S^nax  =  R-max  under  underload  conditions  and  infinite  buffer  capacity  at 
the  scheduler.  Although  R^ax  increases  to  1  as  Pback  increases  (as  said  earlier),  starts  deviating 

from  Ryuax  an(l  declines  beyond  some  value  of  pback  equal  to  about  .78  for  T  =  5.  This  is  due  to  the  fact 
that  the  scheduler  reaches  an  overload  state  (p  >  1)  and  the  dynamics  change.  Specifically,  an  infinite 
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scheduler  queue  backlog  is  built  up  under  these  conditions,  resulting  in  a  different  than  Rmax  output 
rate  from  the  scheduler,  as  discussed  earlier.  Thus,  only  the  results  for  Rmax  +  Pback  <  1  are  relevant. 
The  scheduler  study  under  overload  should  be  employed  for  the  derivation  of  results  for  p  >  1. 

Results  for  Smax  vs  Pback  are  shown  in  Fig.  8  for  Pback  such  that  the  scheduler  is  in  overload 
state  (p  >  1).  The  results  can  be  calculated  either  from  parts  (a)  and  (c)  of  Proposition  8,  or  from 
the  PMF  of  Xk  derived  in  (19)  and  (18)  The  improved  throughput  characteristics  of  the  dynamic- 
R&S  scheme  can  be  clearly  observed.  The  higher  than  the  targeted  throughput  for  low  overload 
conditions  is  in  accordance  with  expectations  based  on  the  increased  regulation  throughput  and  the 
heavy  traffic  assumption  for  the  tagged  source.  As  long  as  p  <  1,  the  regulator  throughput  determines 
the  throughput  at  the  scheduler  as  well,  as  explained  earlier.  When  p  >  1,  some  of  the  regulator  traffic 
is  “absorbed”  by  the  infinite  buffer  built  up  at  the  scheduler  and  a  throughput  reduction  is  observed 
for  this  reason.  Nevertheless,  by  reducing  W  (see  equation  (1),  the  dynamic-R&S  scheme  is  capable  of 
providing  the  targeted  throughput  under  severe  overload  conditions.  In  the  limiting  case  of  very  large 
overload,  W  — >  1  and  the  tagged  throughput  reduction  below  the  targeted  value  is  observed,  induced 
by  the  per  slot  background  batch  size.  It  should  be  noted  that  under  the  static-R&S  scheme  the  tagged 
throughput  falls  dramatically  even  under  low  overload  conditions.  This  reduction  is  directly  related  to 
the  cumulative  over  T  slots  background  arrivals,  as  opposed  to  that  over  W  slots  ( W  — >  1)  under  the 
dynamic-R&S  scheme. 

In  addition  to  maintaining  a  throughput  as  close  to  the  targeted  one  under  overload,  it  is  important 
that  the  variability  of  the  throughput  (or  interdeparture  time  X^)  be  low.  Fig.  9  presents  results  for  the 
variance  of  the  tagged  cell  interdeparture  process  induced  by  the  two  policies  vs  Pback  and  for  underload 
conditions  at  the  scheduler  (p  <  1).  Again,  only  the  results  for  Rmax  +  Pback  <  1  are  relevant.  It  is  clear 
that  the  dynamic-R&S  policy  provides  for  a  less  variable  interdeparture  process  than  the  static-R&S 
one. 

Similar  results  for  p  >  1  are  presented  in  Fig.  10.  These  results  have  been  derived  by  employing  the 
PMF  of  Xk  derived  in  Proposition  6,  for  various  values  of  pback  corresponding  to  success  probabilities 
given  by  —  0. 12  -P  ( A; — 1)*0.02  for  k  =  1,2,... 13.  In  view  of  the  linear  relationship  between  VAR(Xfc) 
and  VAR(Al)  under  the  static-R&S  scheme  (Proposition  8)  the  increasing  behavior  of  V AR{Xk)  as 
Pback  increases  is  expected  and  it  is  observed  in  Fig.  10.  The  results  under  the  dynamic-R&S  scheme 
are  more  difficult  to  interpret.  For  low  overload  conditions  V AR( X^)  decreases  until  pback  =  1*44  (4th 
point  on  the  plot)  and  then  increases  slightly.  X *  depends  solely  on  the  background  accumulation 
over  W  slots  (between  consecutive  tagged  cell  releases).  Therefore,  as  pback  increases  the  condition 
Avv  >  T  —  1  is  expected  to  be  met  in  fewer  slots,  and  therefore  a  cell  would  be  released  earlier.  This 
implies  that  a  decreasing  number  of  batches  would  interfere  with  X^,  reducing  VAR(Xfc).  As  pback. 
increases  the  number  of  batches  which  interfere  with  X \  under  the  dynamic-R&S  policy  reduces  to  1; 
beyond  that  point  the  increased  value  of  V  AR(Xjc)  is  due  to  the  increase  in  VAR(A1). 

The  detailed  traffic  smoothness  characteristics  of  the  two  R&S  schemes  are  shown  in  Fig.  11  -  16 
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for  underload  and  Fig.  17  -  20  for  overload  conditions  at  the  scheduler,  where  the  PMF  of  X k  (jitter) 
and  its  tails  are  plotted. 

The  jitter  ( X k)  PMF  under  both  policies  is  plotted  in  Fig.  11  for  p ^ack  =  .45  and  T  =  5  and 
T  —  10.  Since  p  «  1  the  two  policies  behave  similarly  and  the  tails  of  the  PMF  almost  coincide. 
For  higher  background  utilization  (pback  —  -85)  the  PMF  of  Xk  becomes  quite  distinct  for  the  two 
policies  (Fig.  12).  The  induced  clustering  ( X *  <  T)  and  spreading  (Xk  >  T)  are  better  shown  in 
terms  of  the  tails  of  the  PMF  presented  in  Fig.  13  (15)  and  14  (16),  respectively,  for  T  =  5  (T  =  10). 
Tagged  cell  clustering  (Xk  <  T)  is  seen  to  slightly  increase  under  the  dynamic-R&S  policy  while 
spreading  (Xk  >  T)  is  substantially  reduced  and  more  probability  mass  is  concentrated  around  T. 
While  the  spreading  reduction  under  the  dynamic-R&S  policy  is  expected,  the  slight  increase  in  the 
clustering  is  less  obvious.  It  may  be  attributed  to  the  higher  probability  that  the  scheduler  queue  is 
nonempty  under  the  dynamic- R&S  policy,  due  to  the  higher  scheduler  load  (5max)  resulting  from  a 
higher  regulator  throughput  (Rmnx)-  This  is  discussed  also  below. 

The  traffic  smoothness  characteristics  of  the  two  schemes  under  overload  traffic  conditions  at  the 
scheduler  can  be  observed  in  Fig.  17  and  18,  where  the  jitter  distribution  (or  scheduler  interdeparture 
distribution)  is  plotted  for  two  different  values  of  p^ack  (Back  Util).  In  both  cases  the  jitter  probability 
mass  function  is  highly  contained  around  the  target  value  T  under  the  dynamic-R&S  scheme,  and  it  is 
spread  over  wide  range  of  values  under  the  static-R&S  scheme. 

The  tails  of  the  probability  mass  function  representing  spreading  ( Pr{Xk  >  k}  for  k  >  T),  are 
shown  in  Fig.  19  and  20  for  T  =  5  and  T  =  10,  respectively.  The  good  jitter  characteristics  of  the 
dynamic-R&S  scheme  in  terms  of  reduced  cell  spreading  are  clearly  observed.  Excessive  spreading  - 
occurring  under  network  congestion  (scheduler  overload)  -  may  compromise  the  QoS  of  a  real-time 
application  by  causing  starvation  at  the  end  user.  It  is  evident  that  the  starvation  probability  can 
be  substantially  lower  under  the  dynamic-R&S  scheme.  As  indicated  in  Proposition  7  the  policies 
induce  identical  clustering  since  the  latter  depends  only  on  the  number  of  background  arrivals  over 
T  slots  under  overload  conditions.  Under  underload  conditions,  the  empty  scheduler  queue  process 
also  modulates  clustering.  Since  this  process  is  different  under  the  two  policies,  different  clustering  is 
expected  under  underload  conditions  at  the  scheduler  (shown  in  Fig.  15). 

7  Simulation  Results 

7.1  The  simulated  system 

In  the  preceding  sections  the  R&S  policies  were  considered  for  a  single  (tagged)  source.  All  other  co¬ 
existing  sources  were  aggregated  and  formed  the  background  source.  This  consideration  was  necessary 
in  order  to  analytically  study  the  behavior  of  the  system  and  gain  insight  on  the  operation  of  the  R&S 
policies. 

In  this  section,  a  system  in  which  more  than  one  sources  are  controlled  by  the  R&S  policies 
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is  considered  and  is  studied  by  employing  the  OPtimized  NEtwork  simulation  Tool  (OPNET).  The 
objective  here  is  to  investigate  the  behavior  of  the  two  policies  in  the  presence  of  real  background 
traffic,  as  generated  by  multiple  sources  controlled  by  these  policies.  The  numerical  results  presented 
in  this  section  are  consistent  with  the  expectations  shaped  by  the  analysis. 

A  system  with  N  =  7  ON_OFF  Markov  sources  was  simulated.  Such  sources  are  defined  in  terms 
of  the  parameters  of  the  geometric  distributions  governing  the  lengths  of  the  ON  (Pon)  and  OFF  ( P0ff ) 
periods  and  the  cell  generation  rate  Parr  when  the  source  is  in  the  active  (ON)  state  (no  cells  are 
generated  when  the  source  is  in  the  inactive  (OFF)  state).  The  empirical  PMF  and  the  variance  of  the 
interdeparture  process  and  the  scheduler  throughput  for  each  of  the  sources  were  measured.  The  actual 
cell  interdeparture  times  ( X k)  from  the  scheduler  were  recorded,  after  filtering  out  the  gaps  caused  by 
the  source’s  OFF  periods,  and  a  vector  X  =  [Xj  X2  . . .  Xk  . . .]  was  created  for  each  of  the  sources.  The 
empirical  interdeparture  PMF  for  each  source  was  obtained  as  the  histogram  of  the  samples  in  X. 
The  per  source  throughput  was  calculated  as  the  inverse  of  the  average  of  the  samples  in  X.  Since  X 
does  not  contain  the  OFF  periods,  the  calculated  throughput  will  be  higher  than  the  actual  average 
throughput.  This  calculated  throughput  is  called  the  active  throughput,  Sa.  This  filtering  of  the  data 
allows  for  capturing  the  effect  that  the  policies  have  on  the  sources  while  they  are  active,  without  being 
obscured  by  the  OFF  periods  where  the  policies  are  ineffective.  Sa  can  be  viewed  as  the  average  rate 
of  service  provided  to  a  source  while  it  is  in  the  ON  state.  The  variance  of  the  inter  departure  time 
is  expressed  in  terms  of  a  measure  of  its  deviation  from  the  target  spacing  Xm,n>i  and  is  calculated  as 
T^T  ^k=\{Xk  -  Xmjn>i)2,  for  source  i  and  sample  size  of  L. 

7.2  Results 

A  system  with  N  =  7  Markov  sources  was  considered  in  which  more  than  one  sources  are  controlled 
by  the  R&S  policies.  The  parameters  and  results  of  the  first  simulation  are  shown  in  Table  1;  Pon  = 
Poff  =  -8  for  srco,srci,src2,  Pon  =  -8  and  P0ff  =  .85  for  src3,src4  and  Pon  =  .8  and  P0fj  =  .75  for 
src^.srca.  For  each  source,  the  value  of  Xm,„  and  the  measured  average  source  rate  Xave  are  shown, 
along  with  the  measured  active  throughput  and  variance  VAR[X £]  under  the  dynamic- R&S  and 
Sq  and  variance  V  AR[X%\  under  the  static-R&S  policies. 

The  Statistical  Multiplexing  Gain  (SMG)  -  defined  as  Yi  1/Xmin,i  ~  is  equal  to  3.6  in  this  case. 
The  system  utilization,  p,  is  equal  to  0.9505  (p  =  YiKve,i),  which  should  be  less  than  1  for  a  stable 
system.  The  sources  with  Xm,n  =  1  are  practically  unregulated  since  they  are  allowed  to  release 
their  cells  to  the  scheduler  as  soon  as  they  are  generated.  Such  sources  could  be  ones  without  jitter 
constraints.  Sources  with  Xmin  >  1  are  the  ones  targeted  for  regulation;  for  these  sources,  Xmin  is  set 
to  [1/ParrJ. 

The  positive  impact  of  the  dynamic-R&S  policy  on  the  regulated  sources  (src3,  src4,  src$,  srce)  is 
clearly  observed  in  Table  1:  the  variance  of  the  interdeparture  process  under  the  dynamic- R&S  scheme 
(FA7?[X^j)  is  substantially  smaller  than  the  variance  under  the  static-R&S  scheme  (VA7?[X|]).  This 
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src  # 

Xmin 

A ave 

Si 

VAR[Xdk) 

VAR[Xil 

stcq 

i 

0.2470 

0.44860 

3.2604 

src\ 

i 

0.1952 

0.39611 

4.8649 

src2 

i 

0.2004 

0.40216 

Emm 

4.6249 

4.3924 

srcs 

10 

0.0434 

0.10768 

0.09611 

4.6240 

9.3360 

src4 

10 

0.0422 

0.10568 

0.09444^ 

4.0288 

8.8864 

src$ 

0.1105 

0.20280 

0.18248 

3.9422 

5.1855 

src<> 

5 

0.1118 

0.20731 

0.18620 

3.3056 

4.5055 

Ei  1  /XminA  =  3.6 

E.  A ave,i  =  0.9505 

Table  1:  Simulation  parameters  for  the  1st  simulation  scenario 


quantifies  the  ability  of  the  dynamic-R&S  scheme  to  control  the  delay  jitter  better  than  the  static-R&S 
scheme,  for  all  of  the  regulated  sources.  As  a  result,  under  the  dynamic-R&S  scheme,  Sa  reaches  the 
target  value  ( 1/Xmin )  and  exceeds  it  slightly,  due  to  some  residual  clustering  which  is  present  under 
both  policies.  On  the  other  hand,  under  the  static-R&S  scheme  Ssa  is  lower  than  the  target  value  due 
to  the  static  nature  of  the  policy.  The  dynamic-R&S  scheme  manages  to  serve  the  regulated  sources 
with  a  less  variable  service  rate,  which  reaches  the  target  peak  service  rate.  As  a  result,  the  traffic  is 
better  shaped  at  the  output  of  the  scheduler,  producing  the  desired  performance. 

The  unregulated  sources  (srco,  src\,  srco)  experience  a  slightly  higher  VAP.[Xjf].  Since  the  target 
Xmin  for  these  sources  is  1,  the  two  policies  are  basically  ineffective  and  the  variance  and  active 
throughput  are  shaped  solely  by  the  background  interfering  process  seen  by  each  of  these  sources  during 
each  slot.  The  background  interfering  processes  are  different  under  the  two  policies.  It  is  expected 
that  under  the  dynamic-R&S  policy  more  cells  will  be  released  to  the  scheduler  per  slot  during  the  ON 
period,  due  to  the  early  releases.  Therefore,  more  background  traffic  interferes  with  the  unregulated 
sources  under  the  dynamic-R&S  scheme  resulting  in  more  spreading  which  lowers  S%. 

The  above  results  can  be  viewed  graphically  in  terms  of  the  empirical  PMFs  shown  in  Fig.  21 
and  Fig.  22.  The  probability  mass  at  the  target  value  is  substantially  higher  under  the  dynamic-R&S 
scheme.  This  may  be  attributed  to  the  reduced  spreading.  Every  time  a  cell  is  released  earlier  than 
the  release  time  under  the  static-R&S  policy,  potential  spreading  is  avoided.  The  empirical  PMFs 
for  Xk  <  Xmin  almost  coincide  under  the  two  policies,  implying  that  the  two  policies  generate  the 
same  amount  of  clustering.  The  empirical  PMFs  for  src0,srcusrc2  are  almost  identical  under  the  two 
policies  and  they  are  not  shown  since  they  do  not  provide  any  insight. 

In  order  to  study  the  system  under  extreme  overload,  the  average  rate  of  the  three  unregulated 
sources  was  increased  (Table  2);  the  values  of  Pon  and  P0ff  &re  fhe  same  as  in  the  first  simulation. 
This  could  reflect  a  scenario  according  to  which  the  unregulated  sources  start  to  misbehave  and  become 
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src  # 

X-min 

A ave 

cd 

Si 

VAR[X(] 

VAR[X'k) 

srco 

l 

0.39766 

0.43032 

0.43507 

3.1226 

2.9486 

src\ 

l 

0.38366 

0.42681 

0.43153 

3.1327 

2.9804 

src2 

l 

0.40030 

0.43270 

0.43707 

3.0266 

2.9132 

src3 

10 

0.04330 

0.09293 

0.06548 

8.2622 

48.0283 

src4 

10 

0.04230 

0.09089 

0.06389 

8.9868 

51.9631 

srcs 

5 

0.11112 

0.15797 

0.12405 

11.995 

20.3197 

srcft 

5 

0.11180 

0.16287 

0.12627 

9.8256 

17.7105 

y^j  =  3.6 

£i  W  =  1-4905 

Table  2:  Simulation  parameters  for  the  2nd  simulation  scenario 


bandwidth  greedy.  The  utilization  of  the  system  was  brought  up  to  1.4905  for  some  time  and  then  the 
sources  were  turned  off  and  the  scheduler  was  served  until  it  became  empty.  This  way,  the  behavior  of 
the  two  policies  under  temporary  overload  conditions  could  be  studied. 

Table  2  shows  results  for  the  variance  of  the  interdeparture  process  and  active  throughput  under 
this  scenario.  The  variance  of  the  regulated  sources  under  the  static-R&S  policy  increases  dramatically. 
This  is  not  the  case  under  the  dynamic-R&S  policy  which  manages  to  keep  the  variance  substantially 
lower.  Even  though  under  extreme  overload  the  analytical  results  predict  that  £d  should  be  fairly 
close  to  the  target  value  l/Xmin  (Fig.  8),  it  is  not  seen  here.  The  latter  is  due  to  the  heavy  traffic 
assumption  made  in  the  analytical  study,  under  which  the  regulator  never  empties  and  the  maximum 
effect  of  the  policy  can  be  revealed.  In  the  simulations,  the  regulator  can  be  empty  when  the  conditions 
for  a  cell  release  are  met.  The  eligible  cell  will  be  released  in  a  later  slot  as  soon  as  it  arrives,  allowing 
the  misbehaving  sources  to  secure  more  bandwidth.  The  empirical  PMFs  show  a  dramatic  spreading 
of  the  interdeparture  time  under  the  static  R&S  scheme.  The  dynamic-R&S  scheme  manages  to  highly 
contain  the  PMF  around  the  target  value,  which  is  the  desired  behavior.  Due  to  space  limitation  these 
plots  are  not  presented. 

In  order  to  study  the  system  under  statistical  multiplexing  gain  of  less  than  1,  the  simulation 
with  the  parameters  shown  in  Table  3  was  considered;  SMG  =  0.93  and  p  =  0.7287;  Pon  —  .9  and 
PQff  =  .2  for  srco,  src\,  src2,  src$,  Pon  =  .9  and  P0ff  =  .4  for  src4,src$  and  Pon  =  .8  and  P0jj  =  .75 
for  src§.  The  results  show  that  the  dynamic- R&S  policy  is  now  outperformed  by  its  static  counterpart. 
V AR[Xf\  and  Sd  are  both  higher.  Under  such  circumstances  a  higher  cannot  be  considered  to  be 
an  improvement,  since  it  is  due  to  more  clustering.  Since  SMG  <  1,  there  are  no  periods  of  overload, 
but  the  conditions  for  early  release  can  be  met  for  some  of  the  sources.  As  cells  are  released  earlier  the 
scheduler  occupancy  under  the  dynamic-R&S  scheme  is  expected  to  be  higher,  and  two  consecutive  cells 
can  be  clustered.  Under  the  static-R&S  scheme  the  scheduler  occupancy  is  lower  (below  7)  resulting 
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src  # 

Xmin 

A ave 

Sda 

si 

VAR[Xi) 

VAR[X’t] 

stcq 

10 

0.09970 

0.09979 

0.09914 

1.7742 

1.5668 

| 

10 

0.09949 

0.09949 

0.09899 

2.3031 

1.8366 

1 

10 

0.09911 

0.09911 

0.09845 

3.3517 

3.0075 

src3 

10 

0.09987 

0.09987 

0.09883 

2.4080 

2.1334 

src4 

6 

0.16314 

0.16314 

0.15813 

4.8881 

3.7537 

src$ 

6 

0.16377 

0.16377 

0.15878 

4.0384 

2.9886 

srcG 

5 

0.20161 

0.20161 

0.19424 

2.6167 

1.6143 

]C?  =  0.93 

EiA„ve!l- =  0.7287 

Table  3:  Simulation  parameters  for  the  3rd  simulation  scenario 


in  lower  clustering. 

In  order  to  run  this  simulation  with  SMG  —  0.93  and  p  =  0.74  the  arrival  rates  of  the  sources 
were  increased  by  reducing  the  length  of  the  OFF  and  increasing  the  length  of  the  ON  periods.  This 
makes  the  sources  less  bursty  and  it  makes  peak  rate  allocation  of  connections  cost  efficient.  In  a 
highly  utilized  network  this  scenario  is  not  expected  to  be  the  dominant  one.  Instead  the  previous  two 
are  expected  to  be  the  dominant  ones,  under  which  the  dynamic-R&S  scheme  clearly  outperforms  the 
static-R&S  scheme. 


A  Proofs  of  Propositions  and  Corollaries 

This  appendix  contains  al  the  proofs  for  the  propositions  and  corollaries  presented  in  the  paper. 

A.l 

Proof  of  Proposition  1 

Proof:  Equation  (8)  is  obtained  by  noting  that  the  events  {W  >  j}  and  <  T  —  2} 

are  equivalent  for  1  <3  <T,  writing 


Pr{W  =  j) 


=  Fj-1(T-2)-Fj(T-2) 


and  noting  that 

pr{W  =  1}  =  Pr{Al  >  T  -  1}  = 

=  1  -  Pr{Al  <  T  -  2}  =  F0(T  —  2)  —  F\{T  —  2)  (20) 
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and 


T— 1 

Pr{VF  =  T}  =  Pr {  ^  A  <  T  -  2}  =  FT-i  (T  -  2)  -  Fr(T  -  2)  (21) 

Z  — 1 

Equation  (9)  can  be  shown  by  considering  the  indicator  function  Ij  defined  as  follows.  Let 


h 


1  if  the  previous  (j  —  1)  batches  are  such  that  i  -Aj  <  T  —  2 
0  otherwise 


for  1  <  j  <  T,  and  let  I\  =  1.  Notice  that  Ij  =  1  implies  that  W  >  j  and,  thus, 


(22) 


T 

W  =  Y,h  (23) 

j= i 

Application  of  the  expectation  operator  to  the  above  leads  to  the  following. 

E{W}  =  E{J2  Ij)  =  Y,  EiTi}  =  E  Pr(L  Ai<T-2}  +  l  =  Y  Fj- i(T  -  2)  □  (24) 

j=l  j= 1  j- 2  t=l  j= 1 

A. 2 


Proof  of  Proposition  2 

Proof:  Equations  (10)  and  (11)  are  self-evident  in  view  of  the  heavy  traffic  source  assumption 

which  is  typically  made  to  establish  the  maximum  throughput  and  in  view  of  (9).  The  inequality 
can  be  proven  by  noting  that  W  =  min{B ,T}  <  T  with  equality  only  if  the  stated  condition  is 
met.  Alternatively,  notice  that  T  =  Yf=  \  1  ^  J2j=i  Fj-i{T  —  2)  =  E{W}  since  Fj-\(T  —  2)  <  1; 
equality  holds  if  Fj-\(T  —  2)  =  1  or  Pr{Y^iZ\  Ai  >  T  —  1}  =  0  for  1  <  j  <  T  or,  equivalently,  if 
Pr{YY=\  A,  >  T  —  1}  =  0  which  is  the  stated  condition.  □ 


A. 3 

Proof  of  Proposition  3 

Proof:  The  scheduler  interdeparture  process  X*  can  be  determined  by  the  scheduler  occupancy 

Qk  found  upon  arrival  of  the  kth  tagged  cell  (not  counting  itself).  Since  the  next  arrival  to  the  scheduler 
will  occur  after  Wk  =  min{Bk,T}  slots,  the  scheduler  can  serve  at  most  T  cells  until  the  next  tagged 
cell  arrival  occurs.  Two  cases  need  to  be  distinguished,  depending  on  whether  Q ^  >  T— 1  or  Qk  <T— 2. 
Case  I:  Qk  >  T  -  1 

In  this  case  Xk  is  equal  to  the  number  of  background  cells  which  arrived  over  Wk  plus  1  slots  (see 
Fig  4). 

Case  1.1:  0  <  l  <  N  —  2  (no-clustering) 

Let  m,  1  <  m  <  T,  be  the  slot  over  which  the  (k  +  l)st  tagged  cell  enters  the  scheduler  queue, 
following  the  arrival  of  the  ktfl  tagged  cell,  that  is,  Wk  =  m.  Clearly,  Am  =  Xk  —  1  =  T  +  l  —  1  and 
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Figure  4:  Illustration  of  X k  =  Am  4-  1  when  Wk  =  m  and  Qk>T-  1 


Am~l  <  T  —  2.  The  smallest  possible  value  of  X k  not  inducing  clustering  is  Xk  —  T.  On  the  other 
hand,  the  largest  possible  value  of  Xk  allowed  by  the  dynamic-RfcS  policy  is  X^ .  =  T  4-  N  —  2,  since 
it  is  possible  that  T  —  2  background  cells  have  been  accumulated  by  the  (m  —  l)st  slot  and  a  batch 
of  maximum  size  (equal  to  TV  —  1)  occurs  during  slot  m.  As  a  result,  T  <Xk<T  —  2  +  N  —  1  +  1 
implying  that  0  <  l  <  N  —  2.  In  this  case, 

T 

Pr{Xk  =  T  +  l/Qk  >  T  -  1}  =  ]T  Pr{Am  =T  +  l-  1,  Am_1  <  T  -  2} 

m=l 

Case  1.2:  —  (T  -  1)  <  /  <  0  (clustering) 

Clustering  can  happen  only  if  the  background  arrivals  over  T  slots  do  not  exceed  T  —  2.  At  slot 
T  a  cell  will  be  released  to  the  scheduler  (14  =  T),  creating  clustering.  Since  Qk  >T  —  1  it  is  implied 
that  Xk  =  AT  4-  1  and,  thus, 

Pr{X*  —  T  4-  /}  —  Pr{AT  =  T  +  l  -  1}. 

Case  IT:  Qk  =  i<T -2 

Since  Q*  =  i  <  T  —  2,  it  is  possible  that  the  server  will  exhaust  the  Qk  + 1  cells  before  the  (k  +  l)st 
tagged  cell  arrives  and  serve  some  of  the  background  built  up  behind  the  kth  cell  (see  Fig.  5).  As  in 
Case  I,  the  following  cases  are  considered. 

Case  II. 1:  l  >  0  (no-clustering) 

Let  m  be  the  slot  over  which  the  (k  4-  l)st  tagged  cell  enters  the  scheduler,  following  the  arrival  of 
the  kth  tagged  cell  (that  is,  Wk  =  m). 

Case  Il.l-(i)  1  <  m  <  i  +  1 

In  this  case  the  server  has  not  had  a  chance  to  serve  any  of  the  background  cells  built  up  behind  the 
kth  tagged  cell.  This  case  is  then  equivalent  to  Case  1.1  and  therefore, 

2  +  1 

Pr{Xk  =  T  +  l/Qk  =i}=  Pr{Am  =  T  +  l-  1,  Am~l  <T -2}. 

m—  1 
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Figure  5:  V*.  =  j,  Qk  <T  -  2:  Scheduler  can  serve  up  to  j  cells  form  the  A1+l 


Case  IT.l-(ii)  i  +  2  <m  <T 

Tn  this  case  the  server  has  served  the  kth  tagged  cell,  as  well  as  some  of  the  background  built  up  behind 
the  kth  tagged  cell,  before  slot  m.  Xk  will  be  determined  by  the  scheduler  occupancy  at  slot  m.  Since 
time  slot  m  depends  on  the  background  accumulation  behind  the  fcth  tagged  cell,  and  this  quantity 
cannot  be  determined  by  the  occupancy  process,  both  the  background  accumulation  process  behind 
the  kth  tagged  cell  and  the  queue  occupancy  process  should  be  considered  jointly  after  time  i  + 1.  (see 
Fig.  5). 

Let,  A{j)  denote  the  cumulative  background  arrivals  between  time  slot  j  =  0  (which  is  slot  i  +  1) 
and  (including)  slot  j  =  T  -  i  —  1,  following  slot  i  +  1.  Let  Q(j)  denote  the  scheduler  queue  occupancy 
at  the  jth  slot,  including  arrivals  during  the  jth  slot.  The  following  lemma  describes  the  evolution  of 
process  {A{j),  Q{j)}J=£~l- 

Lemma  1  :  The  evolution  of  process  {A{j),Q{j)}Jlo~l  is  given  by 

A(j)  =  A{j-l)  +  Ax  (25) 

Q{j)  =  max{Q{j  -  1)  -  1,0}  4-  A1  (26) 

and,  thus,  the  process  evolves  as  a  Markov  chain.  The  initial  state  of  this  process  will  be  taken  to  be 
given  by  (J4(0),Q(0)}  =  {0,  n},  where  n  is  the  queue  occupancy  at  slot  i  +  1.  The  1-step  transition 
probabilities  are  given  by: 

Pr{A(j)  =  i2,  Q{j)  =  hi  Mi  -  !)  =  *ii  Q(i  -  !)  =  h)  = 

{Pr{Al  =  i2  -  *i}  if  h  =  rnax{ji  -  1,0}  +  h  -  n 
0  otherwise. 

Since  the  evolution  of  this  process  will  be  terminated  when  the  cumulative  background  arrivals  n  +  A{j) 
exceed  T  -  2,  the  k- step- constrained  (by  n  +  A(j)  <T  —  2)  evolution  of  {A{j),Q{j)}3jZ l~'~l  will  be 
considered  only.  The  corresponding  k-step-constrained  transition  probabilities  are  given  by  the  following 
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recursions,  where 

Prk{A(j)  =  of,  Q{j)  =  Qf/Mj  ~k)  =  a/,  Q{j  -  k)  =  qj}  =  Prk{aF ,  gf/o/,9/}, 
Prfc{aF,gF/a/,9/}  = 

T-2-q, 

=  Y  Pr{ai,qi/ai,qi}  *  Prk~l{aF,qFlai,q\}  - 

Q)  =0 

where  q\  =  max{qi  —  1,0}  +  oi  —  a/ 

T- 2-9/  T—2—qi 

-  E  E  Pr{ai,gx/a/,9/}  *  Pr{a2, 92/01, 9i}  *  Prk  2{of,  92}  =  *  *  * 

ai=0  a2=ai 

where  92  =  max}?;  —  1, 0}  +  02  —  01  □ 


As  it  can  be  seen  in  Fig.  5,  Q{j)  =  T  +  l  -  j  -  1  in  order  to  achieve  a  target  Xk  =  T  +  1,  given 
that  the  (k  +  l)st  cell  arrives  at  the  scheduler  during  the  jth  slot,  l<j<T-i-2.  At  the  same 
time  A{j)  >  T  -  1  -  n,  since  A{j)  is  equal  to  the  background  accumulation  after  the  (i  4-  l)st  slot 
and  at  slot  i  +  1  there  were  Q(0)  =  n  =  Ai+1  background  cells  behind  the  kth  tagged  cell.  Given 
(A(0)  =  0,  Q(0)  =  n}  the  event  that  will  provide  for  the  target  Xk  —  T  +  l  is  {A(j)  >  T- 1  —  n,  Q(j)  = 
T  +  l  —  j  —  1}.  This  event  can  happen  any  time  between  1  <  j  <  T  —  i  —  2;  its  probability  is  given  by 
the  j-step-constrained  transition  probability  of  the  two  dimensional  Markov  chain  {A(j),Q(j)}J~, 01-1 
which  was  derived  earlier.  The  slot  j  =  T  —  i  -  1  is  considered  differently,  since  at  j  =  T  -  i  -  1  or 
m  =  T  a  cell  will  be  released  independently  of  the  state  of  A(T  —  i  —  1).  In  this  case,  the  event  of 
achieving  the  desired  Xk  =  T  +  /  is  given  as  {A(T  —  i  —  1)  >  0,  Q(T  —  i  —  1)  =  i  +  1}  (again,  the 
constrained  evolution  of  {A(j),  1  1  is  considered. 

In  view  of  the  above, 

T-2 

Pr{Xk  =T  +  l/Qk  =  i}  =  Y2  Pr{Ai+l  =  n, Xk  =T  +  l} 

n= 0 

T-2 

=  Y2  Pr{Xk  =  T  +  l/Ai+l  =  n}  *  Pr{AH1  =  n} 

Tl~0 

T-2 

=  Y,  Prixk  =  T  +  l/Mo)  =  0,  Q(0)  =  n}  *  Pr{Ai+l  =  n} 

n—0 

T— 2  T— i— 2  T-3+N-n 

=  EZ  E  Prj{Q(j)=T  +  l-j-l,A(j)  =  aj/A(0)  =  0,Q(0)=n}* 

n=0  j= 1  a;=T—  1— n 

*  Pr{AI+l  =  n}  + 

T-2T  —3+N—n 

+  E  E  P^-'^iQU)  =  i  +  =  aj/A{0)  =  Q,Q{0)  =  n}*  Pr{Al+1  =n} 

71=0  Qjf  =0 

Case  II.2:  l  <  0  (clustering) 

Clustering  can  happen  only  if  the  background  accumulation  over  T  slots  is  less  than  T  —  1  cells. 
This  can  be  seen  in  Fig.  5,  by  setting  j  =  T  —  i  —  1,  which  gives  Q(T  —  i  —  1)  =  i  +  l,  and  since 
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I  <  .0,  this  creates  clustering.  By  employing  similar  argument  as  in  Case  II.  1  it  can  be  shown  that 
given  (A(0)  =  0,  Q(0)  =  n},  the  event  that  will  provide  for  the  target  Xk  =  T  +  l  is  {A(T  —  i  —  1)  < 
T-2-n,  Q(T  -  t  -  1)  =  t  +  /}.  Thus, 

Pr{Xk  =T  +  l/Qk  =  i}  =  Tf  Pr{A^x  =n,Xk=T  +  l} 

n— 0 

T —2  T —2—n 

=  £  £  PrT~i~l{A{T-i-l)  =  a,Q{T-i-l)=i  +  l/A{0)  =  0,Q{0)=n}* 

n= 0  q=0 

*Pr{A'+x  =  n} 

Putting  all  the  cases  together  employing  indicator  functions,  equations  (13)  and  (14)  are  obtained. 

□ 


A. 4 


Proof  of  Proposition  4 

Proof:  Let  Qk  be  the  scheduler  queue  occupancy  upon  arrival  of  the  kth  tagged  cell  (without 

counting  itself)  and  let  m  denote  the  slot  over  which  the  ( k  +  l)st  tagged  cell  arrives.  The  following 
cases  need  to  be  considered. 

Case  I:  Qk  =  i  >  T  —  1 

It  is  easy  to  establish  that  {Q*}fc>i  for  Qk  =  i  >  T  —  1  evolves  as  a  Markov  chain  according  to 
the  equations: 


Qk  +  1  —  m  4-  Am  for  1  <  m  <  T  —  1 
I  if  T  —  1  <  Am  <  T  -  3  +  N,  Am~x  <  T  -  2 

+  Qk  +  1  —  T  +  At  for  m  =  T 

k  if  AT~X  <T-  2 

and  the  transition  probabilities  are  easily  determined  as  follows: 
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Pr{Qk+ 1  =  j/Qk  =  *'}  = 

=  Pr{i  +  1  -  m  +  Am  =  j,T  -  1  <  Am  <  T  -  3  +  N,  4m_1  <T-2,l<m<T-l}  + 

+  Pr{i  +  l-  T  +  AT  =  j,AT<T-3  +  N,  AT~X  <  T  -  2}  = 

=  Pr{Am  =  j  —  i  —  l  +  m,T  —  1  <  Am  <T  -3  +  N,Am~l  <T-2,l<m<T-l}  + 

+Pr{AT  =  j  —  i  —  l  +  T,AT<T  —  3  +  N,  ^T_1  <  T  -  2}  = 

—  Pr{Al  =  j  -i}  *  l{T-i<j-i<N-i}n{T-\<N-\}  + 

T- 1 

+  ^2  P,r{AnL  =  j  —  i  +  m  —  1 ,  Am~l  <  T  -  2}  *  l{r-m<ij-i<T-2+iv-m}n{r-i<m*(Af-i)}  + 

m—2 

+Pr{AT  =  j  -  i  —  1  +T,At  1  <  T  -  2}  *  l{_(r_1)<J_i<Ar_2} 


A  typical  evolution  of  the  scheduler  queue  when  Q *  <  T  —  2  is  shown  in  Fig.  5.  A  tagged  cell  can 
arrive  either  during  the  first  i  +  1  slots,  or  during  the  interval  [i  4-  2 ,  T].  In  the  former  case  the  queue 
evolution  is  described  similarly  to  the  case  described  above  for  Qk  >  T  —  1.  In  the  later  case,  the  queue 
evolution  is  described  by  employing  the  Markov  chain  defined  in  Lemma  1.  The  system  equations  are 
given  by  : 

Qk  +  1  —  m  4-  Am  for  1  <  m  <  i  +  1 

if  T  -  1  <  Am  <  T  -  3  +  N  and  Am~l  <  T  -  2 
Q(n)  for2  +  2<m<T-l  and  \ <n  <T - i -2 

if  A(n)  >  T  -  1  -  Ai+l  and  i(n  -  1)  <  T  -  2  -  Ai+l  ' 

Q(T  —  2  —  1)  for  m  =  T 

if  A(T  —  i  —  2)  <  T  —  1  —  Ai+l 

It  is  easy  to  establish  that  {Qk}k> l  for  Qk  =  i  <  T  -  1  evolves  as  a  Markov  chain.  Its  transition 
probabilities  are  easily  as  follows: 

Transition  Probabilities 

For  1  <  l_  <  i  +  1  the  transition  probabilities  can  be  obtained  by  the  same  reasoning  as  in  Proposition 
3  : 

Pr{Qk+\  =  j/Qk  =  *}  = 

Pr{Al  -  j  -i}*  l{r-i<j-i</V-i}n{T-KA'-i)  + 

l+l 

+  Pr{Am  =  j  —  i  -  1  +  771,  Am~X  <T  —  2}  *  l{T-m<j-t<T-2+N-m}n{r-l<m*(N-l)} 
m= 2 

Note  the  upper  bound  on  the  index  of  summation. 

For  i  ±  2  <  /  <  T  -  1  the  Pr{Qk+ 1  =  j/Qk  =  i}  when  Al+l  =  n  (see  Fig.  5),  can  expressed  in  terms 
of  (.4(n).  Q(rt)}  as  follows  : 

Pr{Qk+x=j/Qk=i}  = 

Pri{A(j)  =  aj,  Q(j)  =  j/A(  0)  =  0,  Q(  0)  =  n}  *  Pr{At+l  =  n,Ai  <T  -2} 

for  0  <  n  <  T  —  2 

and  \  <  j  <T  —  i  —  2 

and  T-l-n<aj<T-3  +  N-n 

The  above  gets  translated  into  the  triple  summation  of  15. 

For  1=T 

Pr{Qk+ 1  =  j/Qk  =  *'}  = 

=  PrT~i~l{A(T  —  i  —  1)  =  o,  Q{T  —  i  —  1)  =  j/A(0)  =  0,  Q(0)  =  n}  * 

*Pr{Ai+1  =n,A*'  <  T  -  2} 

for  0  <  n  <  T  —  2 

and  0<a<T  —  3  +  N  —  n 
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The  above  gets  translated  into  the  double  summation  of  15.  □ 


A. 5 

Proof  of  Proposition  5 

Proof  :  Let  tk  (t*,)  denote  the  time  of  the  fcth  tagged  cell  arrival  (departure)  to  (from)  the 

scheduler  queue.  Then: 

k  k  Q k  1  =>  Xk  =  7>+i  —  Tk=T  +  Qk+\  ~  Qk 

Tfc+1  =  tk+l  +  Qk+\  J 

Thus, 

CO 

Pr{Xk  =T  +  l}  =  Pr{Qk+l  -  Qk  =  1}  =  £  Pr{Qk+i  =  l  +  i/Qk  =  t}  *  nq{i)  □ 

t=i 

A. 6 

Proof  of  Proposition  6 

Proof:  Under  the  infinite  scheduler  buffer  and  overload  assumption  it  is  evident  that  the 

scheduler  will  never  be  idle  during  two  consecutive  tagged  cell  interarrivals  and,  thus,  the  distribution 
of  Xk  will  be  given  by  the  T-fold  convolution  of  A 1  under  the  static- R&S  scheme.  Therefore: 

Pr{Xk  =l}  =  Pr{AT  =  1-1} 

and  since 

0  <  At<T*(N-1)=> 

1  <  l  <  T  *  (N  —  1)  +  1 

Similarly  i f  W k  =  m  is  the  time  interval  between  two  consecutive  tagged  cell  releases  under  the  dynamic- 
R&S  scheme  then  the  condition  {Am  =1  —  1,  Am  <T  —  2}  must  be  met  in  order  for  the  interdeparture 
to  be  equal  to  l,  for  l  >  T,  where  1  <m  <T.  It  should  be  noted  that  the  result  of  this  proposition 
can  be  obtained  directly  from  (13)  and  (14)  by  noting  that  all  the  indicator  functions  are  zero  since 
Qk  >  T  —  1.  Therefore  : 

T 

Pr{  Xk  =  T  +  l/Qk  >  T  -  1}  =  Y,  Pr{Am  =T  +  l-  l,Am~l  <T  -  2}  for  /  >  0 

m=  1 

Pr{Xk  =  T  +  l/Qk  >  T  -  1}  =  Pr{AT  =  T  +  l-  1}  for  /  <  0 

and 

Pr{Xk  =T  +  l}  = 

Pr{Xk  =T  +  l/Qk  <  T  -  2}  *  Pr{Qk  <  T  -  2}  + 

+Pr{Xk  =  T  +  l/Qk  >T  -  1}  *  Pr{Qk  >  T  -  1} 

=  Pr{Xk  =  T  +  l/Qk>T-  1} 
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Changing  T  +  l  into  l  and  with  the  help  of  indicator  functions,  the  expression  in  (18)  is  obtained. 

Also  since: 

0  <  Am  <  T  —  3  +  N  => 

1  <  /  <N+T-3  □ 

A. 7 

Proof  op  Proposition  7 

Proof:  Let  Qt  denote  the  scheduler  queue  occupancy  upon  arrival  of  the  fcth  tagged  cell  to 

the  queue.  Under  the  overload  conditions,  Qk  >  T  —  1,  and  thus  the  kth  cell  will  not  depart  before  the 
(k  +  l)st  tagged  cell  arrives  (recall  the  heavy  traffic  assumption  is  made  in  this  paper).  As  a  result,  X* 
will  be  equal  to  the  background  arrivals  over  the  tagged  cell  interarrival  interval  and,  thus,  X£0  =  A *  -1 
and  X[w  =  AT  —  1,  where  W  is  the  generic  random  variable  for  Wf.  and  Vv\  =  min{B)c,T}  <  T.  Since 
Pr{A™  >  m}  <  Pr{Aj  >  m}  for  all  m,  the  first  claim  is  proved. 

Clustering  (A'*  <  T)  occurs  if  and  only  if  the  arrivals  of  the  background  traffic  following  the  arrival 
of  the  kth  tagged  cell  to  the  scheduler,  and  before  the  (k  +  l)st  tagged  cell  release,  are  less  than  T  -  1. 
Since  under  such  condition  the  two  policies  operate  identically,  it  is  clear  that  such  events  will  be 
induced  by  the  two  policies  with  the  same  probability. 

The  third  claim  follows  directly  from  the  expressions  in  Proposition  C  by  setting  l  =  T.  □ 


A. 8 

Proof  of  Proposition  8 

Proof:  Parts  (a)  and  (b)  are  self  evident  in  view  of  the  fact  that  under  overload  A£°  = 

A,1  +  1  and  A]  for  1  <  i  <  T,  are  independent  and  identically  distributed  random  variables. 
Part  (c)  is  also  self  evident  in  view  of  the  fact  that  W  is  a  stopping  time  for  the  random  variables  A], 
i  =  1,2,...  and  the  result  follows  from  Wald’s  equation  (pp.  171  in  [12]).  □ 

The  following  interesting  conclusions  may  be  drawn  from  Proposition  8.  This  result  establishes 
the  magnitude  of  the  difference  in  throughput  under  severe  overload  conditions  achieved  by  the  tagged 
traffic  stream  under  the  two  policies. 

A. 9 

Proof  of  corollary  2 

Proof  of  corollary  2 

Proof:  Under  very  heavy  traffic,  E{A1}  —>  oo,  the  dynamic- R&S  policy  will  release  one  tagged 

cell  per  slot  (that  is  E{W}  — >  1)  and  the  above  follows.  □ 
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Regulator  Throughput 


Figure  6:  Regulator  Throughput  VS  background  utilization  pback  ( Pback  < 


Scheduler  Throughput 


Figure  7:  Scheduler  Throughput  VS  utilization  p  (pback  <  !)• 


Scheduler  Throughput 


Figure  8:  Scheduler  Throughput  VS  background  utilization  pback- 
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Jitter  Variance 


Utilization 


Figure  9:  Variance  VS  background  utilization  pback  {pback  <  !)• 


Jitter  Variance 
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Utilization 


Figure  10:  Variance  VS  background  utilization  pback  (ptack  >  1)-  Pb  =  0-12  +  (fc  1)  *  0.02  where  k  the 
point  of  interest  (k  —  1,2,...  13) 


Figure  11:  Low  link  utilization:  Policies  behave  identically  ( pback  <  1) 
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Jitter  PMF  tor  Dynamic  Policy  a 


Figure  12:  High  link  utilization:  Jitter  perfoi 

(Pback  <  1) 


Figure  13:  Pr{Xk  <  k}  for  k  < 


Figure  14:  Pr{Xk  >  k}  for  k  > 
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dynamic  Policy  -  Back  Util  0.65 
Static  Policy  —  Back  Util  0.55 
Dynamic  Policy  —  Back  Util  0.85 
Static  Policy  —  Back  Util  0.85 


Tail  Distributions  tor  Dynamic  Policy  and  Static  Policy 


Figure  15:  Pr{Xk  <  k}  for  k  <T,  and  T  =  10  {pback  <  1) 


Figure  16:  Pr{Xk  >  k}  for  k  >  T,  and  T  —  10  ( Pback  <  1) 


Jitter  PMF  for  Dynamic  Policy  and  Static  Policy 


Figure  17:  Jitter  distributions  under  moderate  overload  ( pback  >  1) 
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Jitter  PMF  tor  Dynamic  Policy  and  Static  Policy 


ire  18:  Jitter  distributions  under  excessive  overload  (pback  >  1) 


Figure  19:  Pr{Xfc  >  k}  for  k  >  T,  and  T  =  5  ( pback  >  1) 


k 


Figure  20:  Pr{A'^  >  k}  for  k  >  T,  and  T  =  10  ( Pback  >  1) 
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Pit  Jitter  sij  Prf  Jitter  =  i] 


Empirical  Jitter  PMF  tor  Dynamic  and  Static  Policy:  Source  -  3 


Figure  21:  Empirical  PMFs  -  l5t  simulation  senarion 
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Empirical  Jitter  PMF  tor  Dynamic  and  Static  Policy:  Source  -  5 


Figure  22: 


Empirical  PMFs  -  1st  simulation  senarion 
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Abstract 

In  this  work,  the  problem  of  scheduling  packets  with  a  group  as  op¬ 
posed  to  an  individual  deadline  is  considered.  Packets  of  the  same 
group  are  supposed  to  belong  in  the  same  application  data  unit  for 
which  a  deadline  is  set.  and  they  are  assumed  to  arrive  over  fixed  time 
intervals  (frames).  The  Frame-based  Shortest  Time  to  Extinction  (F- 
STE)  scheduling  policy  is  considered  at  a  multiplexer  fed  by  multiple 
streams  with  group  deadlines;  expired  packets  are  dropped  (not  served). 
The  performance  of  the  F-STE  policy  is  evaluated  in  terms  of  the  in¬ 
duced  packet  loss  (deadline  violation)  probabilities.  It  is  established 
that  the  F-STE  policy  outperforms  the  similarly  simple  First  In  First 
Out  ( FIFO )  policy,  as  well  as  the  potentially  more  complex  FIFO  policy 
which  identifies  and  drops  expired  packets. 


1  INTRODUCTION 

High  speed  networks  are  today  considered  to  be  capable  of  supporting 
high-rate  real-time  applications  such  as  video  and  multimedia.  In  order 
for  the  Quality  of  Service  (QoS)  of  such  applications  to  be  delivered, 
networks  must  meet  stringent  performance  requirements  in  terms  of 
delay,  delay  jitter,  throughput  and  packet  loss.  The  timely  delivery  of 
packets  associated  with  a  real  time  application  can  be  achieved  in  a 
number  of  ways.  The  direct  approach  would  be  to  associate  deadlines 
by  which  the  packets  must  be  transmitted  and  employ  deadline-based 
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scheduling  mechanisms  within  the  network.  Examples  of  such  schedul¬ 
ing  policies  are  the  Earliest  Due  Date  (EDD)  (Liu  et  al  1973)  (Lim 
et  al.  1990)  (Saito  1990),  the  Delay-EDD  (Ferrari  et  al  1990)  and  the 
Shortest-Time-to- Extinction  (STE)  (Panwar  et  al  1988).  Alternatively, 
scheduling  policies  which  control  the  induced  delay  jitter  or  guarantee 
a  certain  transmission  (service)  rate  could  be  employed.  Such  policies 
include  the  Virtual  Clock  (VC)  (Zhang  1990)  (Lam  et  al  1995),  Hi¬ 
erarchical  Round  Robin  (HRR)  (Kalmanek  et  al  1990),  Stop-and-Go 
(Golestani  1990),  Rate  Controlled  Static  Priority  Queueing  (RCSP) 
(Zhang  et  al  1993).  Fair  Queueing  (Demers  et  al  1989),  Weighted  Fair 
Queueing  (WFQ)  (Takagi  et  al  1991),  also  known  as  General  Processor 
Sharing  (GPS),  and  the  packet  based  version  of  GPS.  Packet-by-Packet 
Generalized  Processor  Sharing  (PGPS)  (Parekh  et  al  1993)  (Parekh  et 
al  1994).  Since  this  paper  proposes  and  investigates  a  new  deadline 
driven  scheduling  policy,  the  related  EDD  and  STE  policies  mentioned 
above  are  discussed  in  more  detail. 

The  classical  EDD  policy  minimizes  the  maximum  lateness  and  max¬ 
imum  tardiness  by  transmitting  packets  in  the  order  of  their  due  dates. 
The  lateness  of  a  packet  is  defined  to  be  the  difference  between  the  fin¬ 
ishing  t  ime  of  its  transmission  and  its  due  date.  Its  tardiness  is  given  by 
max{0.  lateness}.  The  STE  policy  is  very  similar  to  the  EDD  policy  in 
the  sense  that  the  packet  with  the  earliest  due  date  is  served  first.  How¬ 
ever.  it  differs  from  the  EDD  in  that  packets  which  miss  their  deadline 
(expired  packets)  are  not  transmitted.  It  has  been  established  that  the 
STE  scheduling  policy  is  optimal  in  the  sense  that  it  maximizes  the 
fraction  of  packets  transmitted  before  their  deadline  or,  equivalently, 
minimizes  the  deadline  violation  probability. 

In  some  applications,  such  as  packetized  voice  or  video,  packets  must 
be  transmitted  before  their  deadlines  expire  in  order  to  be  useful.  On 
the  other  hand,  these  applications  can  tolerate  the  loss  of  a  small  frac¬ 
tion  of  packets.  Clearly,  the  STE  scheduling  policy  is  optimal  for  such 
applications.  However,  implementation  of  the  STE  policy  requires  a 
time-consuming  sorting  process  in  order  to  identify  the  task  with  the 
earliest  deadline.  This  makes  the  STE  policy  too  complex  for  real-time 
realization  in  high  speed  networks.  For  this  reason,  the  simple  to  imple¬ 
ment  FIFO  policy  is  often  employed.  Although  the  performance  of  the 
FIFO  scheduling  policy  is  typically  inferior  to  that  of  the  STE  policy 
with  respect  to  the  induced  deadline  violation  probability,  it  is  easy  to 
establish  that  the  STE  policy  and  the  FIFO  policy  that  drops  expired 
packets  become  identical  if  all  packets  sharing  the  same  resource  have 
identical  time  to  extinction  upon  arrival.  In  such  environments,  the 
easy  to  implement  FIFO  which  drops  expired  packets  is  also  optimal 
in  the  sense  that  it  minimizes  the  deadline  violation  probability. 

The  nature  of  most  real-time  applications  suggests,  however,  that  a 
deadline  be  associated  with  an  application  data  unit  (group  of  pack¬ 
ets),  such  as  a  video-frame  in  video  applications,  rather  than  with  in¬ 
dividual  packets.  For  example,  in  the  VBR  video  encoder,  the  data  for 
one  horizontal  strip  of  blocks  is  assembled  into  a  single  self-contained 
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unit  which  is  transmitted  by  means  of  multiple  ATM  cells  (the  unit 
is  referred  to  as  a  video  packet)  (Verbiest  et  al  1989).  The  address  of 
the  strip  in  the  frame  provides  a  time  reference  for  decoder  synchro¬ 
nization.  For  such  applications,  a  deadline  should  be  associated  with 
each  data  unit  rather  than  with  each  packet.  Packets  belonging  in  the 
same  group  now  have  variable  times  to  extinction  upon  arrival,  and  the 
FIFO  policy  is  non-optimal  even  if  all  the  applications  have  identical 
QoS  requirements. 

In  this  work,  the  Shortest  Time  to  Extinction  (STE)  scheduling  pol¬ 
icy  is  considered  for  scheduling  packets  with  a  group  as  opposed  to 
individual  deadline.  Packets  of  the  same  group  are  defined  here  as  pack¬ 
ets  arriving  over  a  fixed  time  interval  (frame).  Packets  belonging  to  the 
group  with  the  earliest  deadline  (shortest  time  to  extinction)  are  served 
first  and  the  policy  is  called  Frame-STE  (F-STE)  due  to  the  central  role 
of  the  underlying  frame  in  determining  the  arrivals  and  the  associated 
deadlines.  It  is  easy  to  establish  that  the  F-STE  is  equivalent  to  a 
dynamic  Deadline-Ordered  Head-of-Line  (DO-HoL)  priority  schedul¬ 
ing  policy.  Clearly,  F-STE  is  optimal  in  the  sense  that  it  minimizes 
the  deadline  violation  probability.  While  a  scheduler  implementing  the 
STE  policy  must  search  for  the  packet  with  the  shortest  time  to  extinc¬ 
tion  every  time  it  serves  a  packet,  no  such  search  is  necessary  for  the 
F-STE  policy.  This  is  an  advantage  intrinsic  to  group-based  scheduling 
policies,  in  which  the  scheduling  priority  needs  to  be  updated  less  often 
than  in  a  packet-based  scheduling  policy,  and  has  been  pointed  out  by 
other  authors  (for  example,  (Lam  et  al.  1996)). 

In  an  environment  in  which  packets  must  be  transmitted  before  their 
deadlines  in  order  to  be  useful,  a  scheduling  policy  that  drops  expired 
packets  is  more  effective  than  the  one  which  serves  expired  packets,  in 
the  sense  that  a  larger  fraction  of  packets  can  be  transmitted  before 
their  deadlines.  In  order  for  expired  packets  to  be  dropped  in  a  FIFO 
scheduler,  they  typically  need  to  be  time-stamped  and  searched  for, 
which  adds  significantly  to  the  implementation  complexity.  In  this  pa¬ 
per.  an  implementation  of  the  F-STE  scheduler  is  outlined  which  does 
not  require  time-stamping  and  searching  mechanisms.  The  performance 
of  the  F-STE  policy  is  compared  with  that  of  the  easy  to  implement 
(standard)  FIFO  policy  which  does  not  drop  expired  packets,  as  well 
as  the  more  complex  FIFO  policy  which  drops  expired  packets. 

Finally,  it  should  be  noted  that  the  non-work-conserving  version 
of  the  F-STE  policy  -  considered  also  in  this  paper  -  resembles  the 
Stop-and-Go  scheduling  policy  (Golestani  1990).  However,  the  objec¬ 
tives,  the  evaluated  performance  measures  of  interest  and  the  work- 
conserving  version  of  the  F-STE  policy  considered  in  this  paper  are 
different  from  those  in  (Golestani  1990). 
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2  DESCRIPTION  OF  THE  SYSTEM  AND  THE 
SCHEDULING  POLICY 

A  network  node  with  N  incoming  and  one  outgoing  links  is  considered 
in  this  paper.  All  links  are  assumed  to  have  the  same  capacity  and  thus 
a  packet  (fixed  size  information  unit)  transmission  requires  the  same 
amount  of  time,  referred  to  as  the  slot.  Slot  level  system  synchronization 
is  assumed  implying  that  packet  arrivals  and  departures  from  the  node 
occur  at  slot  boundaries. 

In  addition  to  the  time  slot,  a  larger  time  constant  of  length  T  (in 
slots),  called  frame,  is  associated  with  each  of  the  input  streams.  T  takes 
integer  values  and  is  assumed  to  have  the  same  value  for  all  streams  to 
simplify  the  analysis  and  the  discussion.  The  relative  shift  of  the  frame 
boundaries  of  the  incoming  streams  can  be  arbitrary.  Without  loss  of 
generality  and  for  simplicity,  the  frame  boundaries  are  assumed  to  be 
uniformly  distributed  in  this  paper.  Packet  arrivals  associated  with  the 
same  stream  and  frame  occur  according  to  a  Bernoulli  process  with  a 
fixed  rate.  This  rate  may  be  different  for  different  streams. 

The  frame  is  also  assumed  to  modulate  the  QoS  of  the  associated 
packets.  Specifically,  all  packets  of  a  stream  arriving  over  the  same 
frame  are  assumed  to  have  the  same  deadline,  set  to  be  equal  to  the 
end  of  the  next  frame.  Since  the  objective  of  this  work  is  to  deter¬ 
mine  whether  significant  performance  gain  can  be  achieved  under  the 
proposed  policy,  the  deadline  is  set  so  that  the  analysis  complexity  be 
minimized.  Packets  which  are  not  transmitted  (served)  by  their  dead¬ 
line  are  dropped. 

It  is  easy  to  see  that  packets  with  the  same  deadline  (end  of  next 
frame)  will  have  different  times  to  extinction  upon  arrival,  since  their 
arrival  times  are  different  (one  packet  per  slot  per  frame).  When  the 
frame  boundaries  associated  with  different  streams  are  not  synchro¬ 
nized.  which  would  be  typically  the  case,  it  is  easy  to  establish  that 
earlier  arriving  packets  from  one  stream  may  have  a  larger  time  to  ex¬ 
tinction  than  later  arriving  packets  from  a  different  stream.  As  a  result, 
the  easy  to  implement  FIFO  policy  would  not  serve  packets  in  order  of 
decreasing  times  to  extinction  and,  for  this  reason,  would  not  minimize 
the  deadline  violation  probabilities;  as  said  earlier,  these  probabilities 
are  minimized  under  the  STE  policy. 

In  this  paper,  packets  are  served  according  to  the  STE  policy.  Be¬ 
cause  of  the  frame-structured  packet  arrivals  and  QoS  definition,  the 
STE  policy  -  called  in  this  environment  Frame-based  STE  (F-STE)  - 
is  easy  to  implement,  and  thus,  can  be  employed  in  a  high-speed  net¬ 
working  environment.  Details  of  a  simple  implementation  are  presented 
in  the  next  subsection. 

The  following  definitions  and  alternative  policy  description  will  be 
useful  in  both  the  analysis  and  the  implementation  of  the  F-STE  pol¬ 
icy.  Figure  1  depicts  the  arrival  axes  of  A’  =  4  streams.  The  frames  of 
all  streams  are  identical  (of  length  equal  to  T  slots)  and  their  bound¬ 
aries  are  uniformly  distributed.  Let  { t *}  denote  the  sequence  of  frame 
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boundaries  from  any  streams,  that  is,  tk  denotes  the  time  when  the  kth 
frame  ends.  The  deadline  of  packets  associated  with  the  frame  ending 
at  tk  is  time  tk+N  •  Let  Ik  denote  the  stream  whose  frame  ends  at 
Ike{0A,2,...,N  -  1}. 


Stream  0 


Stream  1 


Stream  2 
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Figure  1  Frame  structure  of  N  =  4  traffic  streams 

According  to  the  F-STE  policy  and  in  view  of  the  above  assump¬ 
tions.  the  scheduler  serves  the  streams  according  to  the  ordered  HoL 
priority  (Ik+\ ,  h+2,  •  •  • ,  h+N)  over  the  interval  (tk,  2/c+i].  That  is,  high 
priority  is  given  to  packets  from  stream  /*+i,  second  highest  priority 
is  given  to  the  packets  from  stream  Ik+ 2,  etc.  It  is  easy  to  see  that 
this  policy  serves  packets  in  order  of  decreasing  times  to  extinction. 
The  ordered-HoL  priority  (Ik+\ ,  /*+ 2?  -  *  • ,  Ik+ N )  is  cyclically  shifted  at 
the  next  frame  boundary  to  (/*+2i  /jt+3, - /^+at+i);  and  for  this  rea¬ 

son.  the  comprehensive  service  policy  is  called  Dynamic-Ordered  HoL 
priority  policy  (DO-HoL). 


2.1  Implementation 

The  F-STE  policy  (or  DO-HoL  policy  which  drops  expired  packets) 
can  be  implemented  without  requiring  a  search  for  the  packet  with  the 
shortest  time  to  extinction,  as  described  below  and  depicted  in  Figure 
2. 

Each  stream  is  assigned  a  logically  distinct  data  queue  of  capacity 
T  which  is  the  maximum  number  of  packets  delivered  by  the  stream 
over  one  frame.  At  time  tk,  the  content  of  the  data  queue  Ik  is  shifted 
to  the  service  queue  of  capacity  T.  If  the  service  queue  overflows,  the 
dropped  packets  will  be  precisely  the  ones  which  will  miss  their  dead¬ 
line.  Packets  accepted  by  the  service  queue  will  be  served  within  T  time 
units  after  their  shifting  since  the  service  queue  is  served  continuously. 
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One  data  queue  for 
each  of  N  connections 
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Figure  2  Implementation  of  the  F-STE  policy  at  a  node  with  N  in¬ 
coming  links  (packets  arriving  over  a  frame  accumulate  in  the  corre¬ 
sponding  data  queue  and  are  shifted  to  the  service  queue  at  the  end  of 
the  frame). 


If  the  service  queue  becomes  empty  at  some  time  f,  4  <  t  <  4+ n 
then  the  server  either  remains  idle  until  data  queue  4+1  is  shifted  into 
the  service  queue  at  4+ 1  (non-work-conserving  F-STE  policy),  or  the 
server  attends  to  the  data  queues  according  to  the  DO-HoL  priority 
policy  (work-conserving  F-STE  policy). 


3  PERFORMANCE  ANALYSIS 

In  this  section,  the  packet  loss  (deadline  violation)  probability  is  derived 
for  A  streams  served  under  the  F-STE  policy.  The  arrival  process  and 
QoS  requirements  are  as  described  earlier. 

Let  Tk  denote  the  interval  (in  slots)  between  the  two  consecutive 
frame  boundaries  ending  at  4  and  4+1 ;  that  is  r*  =  4+1  —4**  In  view  of 
the  assumptions  in  section  2,  r*  is  independent  of  k  and  will  be  denoted 
by  r.  Clearly,  r  =  j*.  Let  4  denote  the  data  queue  whose  content  is 
to  be  shifted  to  the  service  queue  at  4*  Let  Qk  denote  the  service 
queue  occupancy  at  time  4,  after  the  packets  from  data  queue  4  are 
shifted  into  the  service  queue.  The  2-dimensional  process  {Qk,h}k>o 
with  state  space  {(i,j)  :  0  <  i  <  T,0  <  j  <  N  -  1}  is  defined  to  be 
the  system  process  and  is  employed  in  the  analysis  of  the  scheduling 
policies.  Let  P(i,j,  i9,j9)  denote  the  transition  probability  that  process 
{Qk-  h}k>o  moves  from  state  (i,j)  at  time  4  to  state  at  time 

4+i  •  Note  that  this  probability  is  zero  if  j'  ^  (j  +  l)mod(Ar),  since  only 
transitions  from  j  to  j 1  =  ( j  +  l)mod(Ar)  are  possible.  Throughout  the 
paper,  j'  will  always  be  equal  to  ( j  +  l)mod(Ar). 

In  the  next  subsection,  the  system  process  {Qk*  4}fc>o  is  determined 
under  the  non-work-conserving  F-STE  policy.  In  subsection  3.2,  certain 
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auxiliary  processes  are  defined  in  order  to  obtain  bounds  or  approxi¬ 
mations  on  the  evolution  of  the  system  process  {Qk,h}k> o  under  the 
work-conserving  F-STE  policy.  By  employing  the  derivations  of  sub¬ 
sections  3.1  and  3.2,  the  packet  loss  (deadline  violation)  probability  is 
derived  for  all  cases  in  subsection  3.3.  In  the  derivations  that  follow, 
A ^  denotes  the  random  variable  representing  the  number  of  packets 
from  stream  k  arrived  over  m  slots.  The  probability  mass  function  of 
A ^  is  given  by  the  convolution  of  the  Bernoulli  random  variables  of  the 
associated  stream  which  are  assumed  to  be  of  fixed  value  over  a  frame. 


3.1  System  equations  for  the 

non-work-conserving  F-STE  policy 

Under  this  policy,  packets  can  be  served  only  after  they  have  been 
shifted  to  the  service  queue,  that  is,  only  after  the  end  of  the  frame 
over  which  they  arrive.  The  server  remains  idle  when  the  service  queue 
is  empty  even  if  the  data  queues  may  be  non-empty.  This  non-work- 
conserving  version  of  the  F-STE  policy  is  considered  primarily  to  facili¬ 
tate  the  study  of  the  F-STE  policy  presented  afterwards.  Nevertheless, 
this  policy  may  be  employed  for  applications  in  which  the  packets  of  the 
frame  cannot  be  served  before  the  packets  of  the  entire  frame  have  been 
received.  It  is  easy  to  establish  that  the  system  process  {Qk,  Ik}k> o  is  a 
Markov  chain,  since  Ik  is  a  periodic  Markov  chain  and  Qk+ 1  is  (proba¬ 
bilistically)  determined  from  Qk  and  A The  probability  P(i,  j,  i\j') 
that  the  Markov  chain  moves  from  state  (i,j)  to  state  (i'.f)  is  given 
below.  When  i  >  r,  the  system  queue  cannot  become  empty  between 
two  successive  shifts  to  the  service  queue,  r  slots  apart;  when  i  <  r,  the 
service  queue  becomes  empty  between  two  successive  shifts.  Note  that 
since  only  one  packet  can  arrive  over  one  slot,  the  maximum  number 
of  packets  arriving  over  T  time  slots  (one  time  frame)  is  T. 


Case  A  :  r  <  i 


For  (i-r)<i'<T-  1:  P(i,j,i,,j')  =  P{A]j  =  i,-i  +  T}  (1) 

X 

For  i'  =  T:  P(i,j,i’,j')=  £  P{A$  =  k}  (2) 

k=T-i+r 


Case  B  :  0  <  i  <  r 


For  0  <  z'  <  T  :  P(iJ,i'J')  =  P{A%  =  i'} 


(3) 
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3.2  Auxiliary  system  equations  for  the 
work-conserving  F-STE  policy 

According  to  the  work-conserving  F-STE  policy,  if  the  service  queue 
becomes  empty  at  some  time  t,  tk  <  t  <  4+ 1,  the  server  attends  to 
the  data  queues  according  to  the  ordered  HoL  priority  (4+ 1 , . .  . ,  4+/v) 
(see  section  2).  An  implication  of  this  policy  is  that  the  system  process 
\Qkih}k> o  is  not  a  Markov  chain.  Some  of  the  arrivals  to  be  shifted 

into  the  data  queue  at  4+i>  given  by  Aj+1,  may  have  already  been 
served  while  in  the  corresponding  data  queue.  Since  the  latter  quantity 
is  needed  to  determine  the  evolution  of  {Qk,  h}k>o,  {Qk,h}k> o  is  not 
a  Markov  chain. 

In  order  to  obtain  the  system  evolution  for  the  work-conserving  case, 
the  Markovian  process  {Qk,h,Qk}k> o  can  be  considered,  where  Qk  is 
an  (Ar  —  l)-dimensional  vector  representing  the  occupancy  of  the  data 
queues  which  are  not  shifted  to  the  service  queue  at  time  4i  the  one 
shifted  is  always  0  at  4-  The  resulting  analysis  would  be  involved  and 
the  complexity  would  be  prohibitive  for  a  large  N ,  Nevertheless,  this 
approach  has  been  employed  in  the  study  of  the  work-conserving  F- 
STE  policy  supporting  N  =  2  streams.  The  derivation  of  the  system 
equations  for  this  case  is  presented  in  the  appendix.  Results  for  the  two 
stream  case  are  presented  at  the  end  of  the  paper. 

The  work-conserving  F-STE  policy  for  large  N  is  studied  here  by 
following  the  approach  described  below.  This  approach  leads  to  the 
derivation  of  bounds  and  approximations  on  the  performance  of  the 
work-conserving  F-STE  policy  with  a  complexity  similar  to  that  for 
the  non- work-conserving  case,  even  for  large  A\ 

Since  all  packet  losses  (deadline  violations)  are  “registered”  as  service 
queue  overflows  occurring  at  some  4  (when  the  corresponding  data 
queue  is  shifted  into  the  service  queue),  bounds  on  the  packet  loss 
under  the  work-conserving  F-STE  policy  can  be  obtained  by  defining 
proper  auxiliary  systems  U  and  L  which  are  such  that  their  service 
occupancies  at  times  4  "  denoted  by  Q k  and  Qlk  respectively  -  satisfy 

Qi  <  Qk  <  Qt-  (4) 

For  properly  selected  auxiliary  systems  (policies),  the  above  relation¬ 
ship  would  then  lead  to  a  similar  one  involving  the  corresponding  packet 
loss  probabilities  given  by 

4  <  Ik  <  l\.  (5) 

Furthermore,  by  constructing  the  auxiliary  system  in  a  way  that  the 
system  processes  posses  the  Markovian  property,  the  packet  loss  prob¬ 
abilities  will  be  easily  derived,  as  described  in  subsection  3.3. 

Since  the  non-work-conserving  F-STE  scheduler  remains  idle  when 
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the  service  queue  becomes  empty,  and  is  otherwise  identical  to  the 
work-conserving  F-STE  scheduler,  it  is  evident  that  the  associated  sys¬ 
tem  can  serve  as  the  auxiliary  system  V .  In  3.2.1,  an  auxiliary  system 
L  is  proposed,  and  equations  are  derived  for  the  system  evolution.  An¬ 
other  auxiliary  system  A  is  proposed  in  3.2.2,  in  order  to  obtain  a  close 
approximation  for  the  packet  loss  of  the  work-conserving  F-STE  policy. 
The  following  will  be  satisfied  by  this  auxiliary  system. 

Qi<Qak<Ql  L[  <  L%<  L\.  (6) 

The  tightness  of  the  bounds  and  the  accuracy  of  the  approximate  re¬ 
sults  are  discussed  in  the  last  section  where  numerical  results  are  pre¬ 
sented. 

Auxiliary  system  L 

The  auxiliary  system  L  is  defined  to  be  similar  to  the  system  under  the 
non-work-conserving  F-STE  policy  with  the  following  differences.  If  the 
service  queue  becomes  empty  before  the  next  data  queue  shifting  time 
j_j .  the  number  of  slots  wasted  under  the  non-work-conserving  F-STE 
policy  is  considered  as  a  service  credit  available  at  tk+i]  this  amount  is 
registered  as  an  equivalent  negative  contribution  to  the  service  queue 
occupancy  at  $*+!•  There  is  an  upper  bound  on  the  allowed  credit 
accumulation  equal  to  T  —  r.  This  bound  is  set  by  the  requirement  that 
earlier  service  credit  cannot  be  utilized  by  future  arrivals  since  it  will 
be  lost  ("expired  credit’1).  More  specifically,  service  queue  occupancy 
x  <  r  —  T  at  4  would  imply  the  generation  of  at  least  \x  —  (T  — r)|  credit 
units  during  the  interval  {tk-N,  4-(/v-i)];  this  credit  must  be  wasted 
since  all  packets  in  the  data  queues  at  have  arrived  after 

The  one-step  transition  probability  for  r  —  T  <  i  <  T  for  system  L 
is  given  by 


T 

For  i'  =  T :  P(iJ,i'J')  =  £  P{A$.  =  k}  (7) 

*=X-i+T 

For  (r  —  T)  <  /'  <  T  —  1  :  P{iJJ\f)  =  P{AJj  =  if-i  +  r}  (8) 

2  r—i—T 

For  i'  =  t  —  T  :  P(i,jJ',f)=  £  P{A%  =  k]  (9) 

k= 0 

Auxiliary  system  A 

In  deriving  the  evolution  of  system  X,  it  was  assumed  that  all  the 
"unexpired"  slots  (inducing  Uunexpired”  credit)  wasted  under  the  non- 
work-conserving  policy  could  be  utilized  by  serving  the  data  queues. 
This  is  true  only  if  at  least  one  data  queue  is  non-empty  during  the 
interval  from  when  the  service  queue  becomes  empty,  till  the  next  shift 
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to  the  service  queue.  In  order  to  obtain  a  better  approximation  for 
the  packet  loss  probability  of  the  work-conserving  F-STE  policy,  an 
auxiliary  system  A  is  considered,  in  which  a  more  precise  credit  accu¬ 
mulation  rule  is  applied.  As  in  the  case  of  the  auxiliary  system  Z,  the 
service  queue  capacity  i  must  satisfy  r  —  T  <  i  <  T. 

Case  A  :  il  >  0 

In  this  case,  no  service  credit  is  generated. 

For  (i-r)  <  i'  <  T-  1  :  P{i,j,i',  j')  =  P{A3j  =  i'-i  +  r}  (10) 

T 

For  ir  =  T  :  P(hj,i',j’)  =  E  PiAT  =  k)  (n) 

k=T-i+T 

Case  B  :  if  <  0 
[B 1]  :  i  >  0 

Unlike  in  system  L .  out  of  the  l  =  \i  —  r\  non-used  (by  the  packets  in 
the  service  queue)  service  opportunities  over  (4,4+iL  only  a  portion 
will  be  registered  as  credit,  based  on  the  number  of  (earlier)  packet 
arrivals  to  the  data  queues  which  have  not  been  shifted  to  the  service 
queue  at.  or  before,  4-  Thus,  the  actual  amount  of  credits  kept  will  be 

equal  to  mi»(|/|.  Am)  where  Am  =  +  A^^^A  . . .  +  A^+h  ~2 . 

Each  of  the  terms  in  Am  represents  packet  arrivals  over  some  r  and  for 
some  stream  which  have  not  been  shifted  to  the  service  queue  at,  or 
before.  4-.  Arrivals  over  the  interval  (4»<fc+i]  over  which  this  credit  is 
generated  are  not  considered  since  some  of  them  may  not  arrive  before 
the  credit  is  generated;  clearly,  this  is  an  approximation.  In  this  case, 
the  only  possible  transitions  are  to  state  where  —r  <  if  <  0,  and 
their  probabilities  are  given  by 

i’J')  =  P{AJj  =  i  -  i  +  T}P{A~m  >  |i'|)  + 

E  1  P{4  =  k}P{Am  =  \i'\}  (12) 

Ar=0 


[B  2]  :t  -T  <  ?  <  0 

The  service  queue  is  empty  at  4?  and  |i|  service  credits  have  already 
been  accumulated.  Let  l  =  if  —  i.  If  /  <  0,  |/|  is  equal  to  the  credit 
generated  over  (4,4+i]-  Since  packet  arrivals  to  the  data  queues  oc- 
curing  before  4-i  have  been  considered  toward  the  determination  of 
the  original  credit  |i|  (as  indicated  in  the  previous  case,  [Bl]),  only 
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packet  arrivals  to  the  data  queues  occuring  over  (tk-i,tk]  will  be  used 
to  determine  the  actual  new  credit  to  be  kept.  The  latter  arrivals  are 
given  by  An  =  AF+l  +  AF+2  +  ■  ■  ■  +  AF+N~2.  Finally,  no  credit  is 
generated  if  /  >  0.  The  transition  probabilities  are  given  by: 

For  t  —  T  <  i'  <  0  : 


P{Aj  =  r  +  /}  l  >  0 

P{A%  =  r  +  l}P{An  >  |/|}+ 

U=o~lf>{4  =  k}P{An  =  \l\}  l<  0 


For  i'  =  r  —  T  : 


P(i.j-i'j') 


f  P{A3j  =  r  +  /}  /  >  0 

I  U±l0P{4  =  k}P{An>\l\}  l<  0 


(13) 


(14) 


3.3  Computation  of  packet  loss  probabilities 

In  view  of  the  Markovian  structure  for  the  system  processes  defined  in 
the  previous  subsections,  the  induced  packet  loss  (deadline  violation) 
probabilities  are  easy  to  determine.  As  indicated  earlier,  the  packets 
whose  deadline  expires  are  precisely  the  packets  which  overflow  from 
the  service  queue  upon  shifting  to  that  queue. 

Let  Lij  denote  the  number  of  packets  dropped  over  the  interval 
(4.  tk+ 1].  where  (i:j)  is  the  system  state  at  4-  Notice  that  all  losses  will 
be  associated  with  packets  from  stream  jf  =  (j  +  l)mod(Ar).  Clearly, 

L,j  =  {Aj  +  (  /  —  r)  -  T)+.  (15) 

Notice  that  at  most  i  —  r  packets  may  be  lost  since  the  maximum 
number  of  per  stream  and  per  frame  arrivals  is  equal  to  T;  no  loss 
occurs  if  i  <  r.  The  average  value  of  Lij,  Lij .  is  given  by 

i—r 

Lij  =  J2kF(  4  =  T-(i-T)  +  k).  (16) 

h=  1 


The  loss  rate  for  stream  jf  =  ( j  +  l)mod(Ar)  over  (4i^+i]j  when  the 
system  state  at  4-  is  (L  j),  is  given  by 


(17) 
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where  A-7’  denotes  the  arrival  rate  of  stream  f  per  slot.  Finally,  the 
packet  loss  probability  is  given  by 

L=  £  (18) 

i=r+l  j= 0 

where  7 r(i.j)  is  the  steady  state  probability  distribution  of  the  corre¬ 
sponding  Markov  chain. 


4  NUMERICAL  RESULTS 

In  this  section,  two  sets  of  numerical  results  are  presented.  Figures  3  to  6 
are  derived  to  evaluate  the  accuracy  of  the  bounds  and  approximations 
associated  with  the  analysis  of  the  general  F-STE  policy,  as  well  as  to 
present  exact  results  for  the  non-work-conserving  F-STE  policy,  and 
the  work  conserving  F-STE  policy  with  N  =  2.  Figures  7  and  8  are 
derived  for  the  comparative  evaluation  of  the  F-STE  and  the  FIFO 
policies. 

The  exact  packet  loss  probability  vs  system  utilization  A  (for  T  =  8) 
for  X  —  2  streams  served  under  the  work-conserving  F-STE  are  shown 
in  Figure  3:  simulations  are  also  shown  for  verification  of  the  equation. 
Results  for  systems  U  (non- work-conserving  F-STE  policy)  and  L  are 
also  shown. 

Results  for  the  packet  loss  probability  for  N  =  4  streams  vs  system 
utilization  A  (for  T  =  8)  and  vs  frame  length  (for  A  =  0.88)  are  pre¬ 
sented  in  Figures  4  and  5.  respectively.  Results  are  shown  for  systems 
r .  .4.  and  simulations.  For  the  cases  considered  here,  the  approximate 
system  (system  A)  seems  to  provide  for  an  accurate  approximation 
of  the  performance  of  the  work-conserving  F-STE  policy.  The  lower 
bound  (auxiliary  system  L )  is  observed  to  be  tighter  than  the  upper 
bound  (auxiliary  system  U ,  that  is  the  non-work-conserving  policy). 
As  A  increases,  the  tightness  of  the  lower  bound  improves  since  more  of 
the  unexpired  credit  in  system  L  will  actually  be  used  under  the  work- 
conserving  F-STE  policy.  Similarly,  the  tightness  of  the  upper  bound 
(non-work-conserving  system)  improves  as  A  increases  since  the  server 
will  be  found  idle  less  often  and,  thus,  the  work-conserving  and  non- 
work-conserving  policies  will  tend  to  become  identical.  As  T  increases, 
the  tightness  of  the  upper  bound  deteriorates  since  the  work-conserving 
and  non-work-conserving  F-STE  policies  become  increasingly  different 
and  the  server  is  found  more  frequently  idle  under  system  U  while  there 
is  work  in  the  system.  The  slight  improvement  of  the  tightness  of  the 
lower  bound  as  T  increases  may  be  attributed  to  an  increasing  proba¬ 
bility  that  a  data  queue  is  non  empty  and  thus  increasing  amount  of 
credit  is  kept.  The  impact  of  the  number  of  streams  N  on  the  induced 
packet  loss  probability  is  illustrated  in  Figure  6  vs  frame  length  T  (for 
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A  =  0.88);  the  results  are  for  the  work-conserving  policy  (N  =  2,  exact 
analysis)  and  for  the  approximate  system  A  {N  =  4,8, 16). 

Simulation  results  for  the  induced  packet  loss  probability  under  a 
simple  FIFO  policy,  the  FIFO  policy  in  which  expired  packets  are 
dropped,  and  the  work-conserving  F-STE  policy  are  shown  in  figures 
7  and  8  vs  system  utilization  A  (for  T  —  16  and  N  =  4)  and  vs  frame 
length  T  (for  A  =  0.88  and  N  =  4),  respectively.  The  results  from  the 
analysis  for  the  approximate  system  A  are  also  shown.  All  results  are 
derived  for  symmetric  load. 

As  expected,  the  packet  loss  probability  of  all  three  scheduling  poli¬ 
cies  decreases  as  the  system  utilization  decreases.  The  packet  loss  prob¬ 
ability  also  decreases  as  the  frame  length  increases.  The  latter  may  be 
attributed  to  the  associated  increase  -as  the  frame  length  increases- 
in  the  time  to  extinction  as  well  as  the  increased  smoothness  of  the 
cumulative  traffic  over  a  frame. 

The  FIFO  policy  which  drops  expired  packets  is  seen  to  induce  a 
smaller  packet  loss  probability  than  the  standard  FIFO  policy,  as  ex¬ 
pected.  The  advantage  is  more  pronounced  when  the  scheduler  is  oper¬ 
ating  nearly  at  maximum  capacity  where  the  system  utilization  is  high 
and  the  time  to  extinction  is  short  (that  is,  the  frame  length  is  short). 

As  expected,  the  work-conserving  F-STE  policy  outperforms  both 
FIFO  policies.  It  should  be  noted  that  the  F-STE  scheduler  is  com¬ 
parable  in  terms  of  implementation  complexity  to  the  scheduler  that 
implements  the  standard  FIFO  policy  (which  does  not  drop  expired 
packets).  Comparison  of  these  two  policies  (Figures  7  and  8)  shows 
that  the  F-STE  policy  always  outperforms  the  FIFO  policy  by  a  large 
margin,  irrespective  of  frame  length  and  system  utilization. 

Comparison  of  the  F-STE  policy  with  the  more  complex  FIFO  policy 
which  drops  expired  packets  reveals  that  for  high  system  utilization 
(Figure  7)  the  performance  difference  between  F-STE  and  FIFO  is 
small  due  to  the  scheduler  throughput  limitation.  As  A  decreases,  the 
sub-optimality  of  the  FIFO  becomes  a  factor  with  increasing  weight  in 
inducing  losses.  The  difference  in  performance  between  the  above  two 
policies  is  seen  to  increase  as  the  frame  length  increases  (Figure  8). 
This  may  again  be  attributed  to  increased  sub-optimalitv  of  the  FIFO 
policy:  as  the  frame  length  increases,  the  range  of  extinction  times  of 
packets  of  the  same  frame  increases  leading  to  increased  sub-optimality 
of  the  FIFO  policy. 

Evidently,  the  performance  difference  between  the  two  scheduling 
policies  that  drop  expired  packets  decreases  under  conditions  (small 
frame  length,  high  system  utilization)  in  which  the  packet  loss  is  largely 
due  to  the  scheduler  limitation,  rather  than  the  sub-optimality  of  FIFO. 
The  performance  of  the  standard  FIFO  policy  remains  significantly 
worse  even  under  these  conditions;  this  may  be  attributed  to  the  addi¬ 
tional  sub-optimality  of  this  policy  resulting  from  transmitting  packets 
that  have  exceeded  their  deadlines  which  becomes  significant  under 
these  conditions. 
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APPENDIX 

A  System  evolution  for  2  stream  case  (exact 
analysis) 

The  accurate  calculation  of  the  packet  loss  probability  for  the  F-STE 
policy  in  the  case  of  a  node  with  two  streams  is  described  in  this  ap¬ 
pendix.  As  mentioned  in  subsection  3.2.  the  Markov  process 
{Qk*hiQk}k>o  (denoted  as  process  Mi)  embedded  at  {4}fc>o  needs 
to  be  considered.  Qk  denotes  the  service  queue  occupancy  after  pack¬ 
ets  have  been  shifted  in  from  data  queue  4  at  tk.  Qk  is  an  (N  —  1)- 
dimensional  vector  representing  the  occupancy  of  the  data  queues  at 
time  4  which  have  not  been  shifted  to  the  service  queue;  that  is, 

Qk  ~  (Q^+1 .  Q^+2. ....  Q^fc+N”1),  where  Qk  denotes  the  occupancy  of 

data  queue  j  at  tk.  Notice  that  Q!kk  =  0  since  this  data  queue  is  being 
emptied  (shifted  to  the  service  queue)  at  4  and  thus  there  is  no  need 
to  include  it  in  the  (N  —  1  )-dimensional  process.  The  state  space  of  M\ 
for  A*  =  2  is  given  by  S2  =  {{hjJ)  ’  ^  <  i  <  T.O  <  j  <  N  —  1^0  < 
I  <  r}.  Let  P(i*jJ:if'!j/,lf)  denote  the  probability  that  Markov  chain 
Mi  moves  from  state  at  4  to  state  (ir,j\lr)  at  4+1?  (hdJ)* 

(/'./,  l')cS‘2.  These  probabilities  are  described  below  where  different 
cases  are  considered  based  on  the  starting  state  of  the  service  queue 

«?*  =  /). 

Case  A  :  r  <  ?  (service  queue  will  not  become  empty  before  4+ 1 ) 

For  ( /  —  r)  <  /'  <  T  —  1  :  P(z,4  C  4  j7.  l')  — 

P{AJT  =  i'-i  +  r}P{AJT  =  l'}  (19) 

For  i'  =  T  :  P(iJ,  /;  t'J',  /')  = 

£  PiAT  =  k)P{A{  =  0  (20) 

A:=T — i+r 

Case  B  :  0  <  i  <  r  (service  queue  will  become  empty  before  4+i ) 

In  this  case,  the  service  queue  becomes  empty  at  time  4  +  i,  and  re¬ 
mains  empty  (for  a  period  of  r  —  i  slots)  till  the  next  shift  to  the  service 
queue  at  4+i-  Over  the  interval  (4  +  i,4+i]*  packets  are  served  from 
the  data  queues  according  to  the  HoL  priority  (4+1,4+21  = 

Two  cases  need  to  be  considered  depending  on  whether  il  =  0  or  i*  >  0 
as  explained  below. 
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Case  B.l  :  0  <  i  <  r,  if  >  0 

In  this  case,  data  queue  4+1  —  jl  is  always  nonempty  over  ( 4  + 
i,tk+ 1]  since  otherwise  i*  =  0.  The  latter  is  seen  by  noting  that  data 
queue  jf  will  remain  empty  after  it  becomes  empty  for  the  first  time 
within  the  interval  (tk  +  i,4+i]  ”  and,  thus,  if  =  0  -  since  stream  j* 
generates  at  most  one  packet  per  slot  and  receives  HoL  priority  over 
the  interval  (tk  +  i,4+i]*  As  a  result,  no  other  data  queue  receives 
any  service  over  this  interval  when  V  >  0  and  the  transition  probabili¬ 
ties  of  M\  depend  only  on  the  cumulative  arrivals  over  T  for  stream  f 
and  r  for  stream  j,  and  they  are  identical  to  those  under  Case  A  above. 

Case  B.2  :  0  <  t  <  r,  if  =  0 

In  this  case,  data  queue  4+i  =  f  becomes  empty  at  some  slot  in 
{tk  +  4+ 1]  (and  remains  empty  thereafter).  During  the  slots  in  which 

data  queue  4+i  =  j*  is  empty  and  no  packet  is  generated  by  stream  f 
(which  would  be  served  under  the  HoL  priority),  the  other  queue  will 
be  served  if  nonempty.  In  order  to  determine  the  content  of  data  queue 

j  at  4+i .  to  determine  the  state  of  Qk+\  =  Q3k +1  (since  4+i  =  f  and 

j  =  (f  +  1)  mod  (2)),  the  slot  by  slot  evolution  of  data  queue  j  needs 
to  be  followed  over  the  interval  (ijt  +  i,4+i].  Since  the  evolution  of  this 
queue  depends  on  whether  data  queue  jf  is  empty  or  not,  the  evolution 
of  both  queues  over  (4  +  L4+i]  is  considered  bv  using  the  auxiliary 
Markov  chain  defined  next. 

Let  (QJ0  ,  QJ0)  denote  the  occupancies  of  the  corresponding  data  queues 

when  the  service  queue  becomes  empty  (at  4  +  /).  Let  {QJn  ,QJn}n> o  de¬ 
note  the  data  queue  occupancy  process  embedded  at  the  slot  boundaries 

with  initial  state  (QJ0,QJ0).  This  process  (referred  to  as  process  A/2) 
evolves  as  a  Markov  chain  over  the  interval  (tk  +  L4+i]  and  its  state 

after  r-i  transitions,  is  such  that  :  (Qk+i,h+\,Qk+i)  = 

(Qi-irj'-  Qr-i)-  That  is.  the  (r— i)-step  transition  of  M2  will  determine 
the  state  of  M\  at  4+1  •  The  1-step  and  m-step  transition  probabili¬ 
ties  of  M2  are  derived  below.  Let  the  state  space  of  M2  be  defined  by 

^2  =  {(<7 i  •  0  <  ql  <  T,  0  <  q-l  <  r}.  Let  A3  denote  the  Bernoulli 
random  variable  describing  the  number  of  packets  generated  over  a  slot 
by  stream  j. 

For  0  <  Q3n  <  T,  data  queue  jf  will  be  served  over  the  current  slot  and 
thus  the  evolution  of  the  queues  is  given  by 

Qn+l  =  Qil-l  +  AJ', 

QJn+l  =  Qi+Ai. 


(21) 

(22) 
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Thus,  the  1-step  transition  probability  of  M2  is  given  by 

PM2  (<7n  »  «7n ;  9n+l  ’  ?n+l  )  =  Pi ^  =  #+1  ~  <?n  +  1 } 

(23) 

for  0  <  qi'  <T,  0  <  ft'+1  <  T,  0<ft,#+1<r. 

For  =  0.  data  queue  j  will  be  served  if  nonempty  and  no  packet  is 
generated  by  stream  j1  over  the  current  slot.  Thus  the  evolution  of  the 
queues  is  given  by 

C,  =  o.  <24> 

QUi  =  (Qi  -  (25> 

Thus,  the  1-step  transition  probability  of  M2  is  given  by 
^A/2(Mn-Mn+l)  =  PiAJ  =  1)P{A3  =  qJn+ 1  -9n)  + 

P{Aj'  =  =  q{+l  -  qi  Hr  1}  (26) 

for  0  <  qJn  <  t.  0  <  qJn+]  <  t,  and 

Pa/2(0.():0.c/;'+1)  =  P{AA  =  l}P{Aj  =  <&+1 }  + 

=  °>V„+1=o}  (2?) 

for  0  <  </''+1  <  1. 

The  m-step  transition  probability  of  M2  can  be  derived  recursively  in 
terms  of  the  (m  -  l)-step  and  the  1-step  transition  probabilities  and  it 
is  given  by 


PFl2(qn^ni<In+mi€+m)  = 

T.  y  PFl2  1  (9n  >  qJn'i  9n+m-l  )  qii+m-l ) 

m  —  1  =0  ^n  +  m-l- 0 

PM2(Qn+m- 1  >  9n+m-l  ’  Qn+m  ’  9n+m  ) 

0  <  (iJn.<in+m-v€+m  <T,  0<  C  #+m-l  ^9n+m  <  T. 


(28) 
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Finally,  the  transition  probabilities  of  process  for  case  B/2  are  given 

by 


p(i,j,ho,j\n  =  E  E  piAi  =  ww  = k 2} 

fcl=0  k2=0 

PJ£\l  +  kuk2;0,l%  (29) 

where  Aj  denotes  the  number  of  packet  arrivals  from  stream  j  over  i 
slots  with  AJ0  =  0. 


B  Computation  of  packet  loss  probability 

The  loss  probability  can  be  calculated  by  following  a  similar  approach 
as  in  subsection  3.3.  Let  ( i,j,l )  be  the  system  state  at  time  4-  The 
number  of  packet  losses  from  stream  j'  =  (j+l)mod(2)  over  the  interval 
(4-U+ 1]  is  given  by 


Ll}l  =  (4  +  (i-T)-T)+.  (30) 

Notice  that  only  packets  from  stream  j'  may  expire  over  this  interval. 
The  average  value  of  Liji  is  given  by 


Liji  =  £  kP(4  =  T  -  (i  -  r)  +  k).  (31) 

t=i 

The  loss  rate  for  stream  j '  over  (tk,tk+ 1],  when  the  system  state  at  4 
is  (i.j.  /).  can  now  be  written  as 


Rijl  = 


ki.il 
A  >'7" 


The  total  loss  probability  is  therefore 


(32) 


l=  E  EEvo-i’ 

t=r+ 1  j»=0  /=U 


(33) 


where  7 r(i,jj)  is  the  steady  state  probability  distribution  of  the  Markov 
chain  i\l\. 
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Figure  3  Analytical  results  for 
loss  probability  vs  system  utiliza¬ 
tion  for  N  =  2  and  T  =  8. 
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Figure  4  Analytical  results  for 
loss  probability  vs  system  utiliza¬ 
tion  N  =  4  and  T  —  8. 
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Figure  5  Analytical  results  fc. 
loss  probability  vs  frame  length  N 
=  4  and  system  utilization  equal 
to  0.88. 
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Figure  6  Analytical  results  for 
loss  probability  vs  frame  length, 
for  different  values  of  N  given  by 
system  A  for  system  utilization 
equal  to  0.88. 
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Figure  7  ^oss^probabilitv  vs  sys¬ 
tem  utilization  for  F-STE  and 
FIFO  for  Ar  =  4  and  T  =  16. 
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Figure  8  Loss^robability  versus 
frame  length  for  F-STE  and  FIFO 
for  Ar  =  4  and  system  utilization 
equal  to  0.88. 
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Abstract 

Since  delay  tolerable,  the  ABR  applications  can  be  allocated 
the  remaining  resources  after  CBR  ( Constant  Bit  Rate )  and 
VBR  (Variable  Bit  Rate)  applications  have  been  accommo¬ 
dated.  To  avoid  excessive  losses ,  the  transmission  rates  of 
the  ABR  applications  should  be  modulated  by  the  amount  of 
remaining  resources .  That  is,  the  ABR  rates  should  be  con¬ 
trolled  through  a  feedback  based  rate  control  mechanism.  In 
this  paper .  a  network  link  shared  by  remote  ABR  and  VBR 
applications  is  considered  and  the  impact  of  various  system 
time  scales  on  the  effectiveness  of  the  feedback  based  flow 
control  scheme  is  investigated.  These  time  scales  are  ex¬ 
pressed  in  terms  of  the  network  transmission  speed,  the  min¬ 
imum  tolerable  ABR  rate  and  the  rate  of  change  of  the  VBR 
source  rate.  While  the  negative  impact  of  a  decreased  net¬ 
work  time  scale  on  the  effectiveness  of  this  control  scheme  is 
well  known,  the  impact  of  the  ABR  and  VBR  time  scales  has 
not  been  investigated  in  the  past.  It  turns  out  that  for  a  given 
network  time  scale .  the  induced  cell  losses  can  be  significantly 
reduced  for  increased  ABR  and/or  VBR  time  scales  and  thus, 
the  latter  time  scales  should  be  taken  into  consideration  when 
evaluating  the  effectiveness  of  an  adaptive  feedback  based  rate 
control  mechanism. 


1  Introduction 

As  indicated  by  the  ATM  Forum,  the  ABR  service  class  al¬ 
lows  non-real  time,  delay  tolerant  traffic  sources  to  share  the 
available  bandwidth  not  utilized  by  the  VBR  and  CBR  ser¬ 
vice  classes.  Due  to  the  fluctuations  in  the  bandwidth  usage 
of  VBR/CBR  sources,  the  ABR  source  rates  should  be  con¬ 
trolled  through  a  feedback  based  rate  control  mechanism. 

When  the  ABR  source  receives  the  feedback  generated 
by  the  receiving  end  or  a  network  node,  it  adjusts  its  rate 
to  the  current  available  bandwidth  according  to  the  traffic 
control  policy  in  effect  [1],  [2],  [3].  Several  traffic  control 
policies  have  been  proposed  for  the  rate  based  flow  control 
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Project  Agency  under  Grant  F49620-93-1-0564  monitored  by  the  Air 
Force  Office  of  Scientific  Research  (AFOSR)  and  the  National  Science 
Foundation  under  Grant  NCR-9628116. 


such  as  the  Explicit  Forward  Congestion  Indication  (EFCI) 
[4],  Backward  Explicit  Congestion  Notification  (BECN)  [5], 
Proportional  Rate  Control  Algorithm  (PRCA)  [6],  Explicit 
Rate  Feedback  (ERF)  [7]  and  Enhanced  Proportional  Rate 

Control  Algorithm  (EPRCA)  [8]. 

Under  the  feedback  based  rate  control  policy  used  in  this 
paper,  a  feedback  carrying  information  about  the  available 
bandwidth  is  transmitted  to  the  ABR  source,  every  time 
that,  the  available  bandwidth  for  the  ABR  source  changes. 
If  the  propagation  delay  between  the  ABR  source  and  the 
network  access  node  is  non-negligible,  then  the  ABR  source 
will  transmit  at  a  rate  based  on  the  most  recent  level  of 
available  bandwidth  until  it  receives  the  feedback.  For  ex¬ 
ample,  if  the  propagation  delay  between  the  ABR  source 
and  the  network  access  node  is  Td  and  the  available  band¬ 
width  changes  at  time  the  ABR  source  will  learn  about 
the  change  in  bandwidth  availability  at  time  t  +  T<*.  The 
adjusted  rate  from  the  ABR  source  will  reach  the  network 
access  node  at  time  t  4*  2 Td,  assuming  that  the  backward 
and  forward  propagation  delays  are  equal.  Therefore,  for  a 
roundtrip  propagation  delay  of  2Td  a  bandwidth  mismatch 
will  occur.  During  a  bandwidth  mismatch  the  capacity  of 
the  link  will  either  be  exceeded  (overutilization)  or  will  be 
used  inefficiently  (underutilization).  Since  ABR  cell  losses 
occur  during  overutilization  periods  of  bandwidth  mismatch 
cycles  and  the  magnitude  of  bandwidth  mismatch  is  propor¬ 
tional  to  the  propagation  delay,  the  effectiveness  of  a  rate 
based  flow  control  scheme  decreases  with  increased  propaga¬ 
tion  delay.  The  impact  of  increased  propagation  delay  on  the 
flow  control  schemes  has  been  studied  in  the  past  [9],  [10], 
[11],  [12].  Besides  the  physical  distance  other  system  param¬ 
eters  namely  the  network,  ABR  and  VBR  time  scales  impact 
considerably  on  the  effectiveness  of  the  flow  control  schemes. 
In  this  paper,  the  impact  of  these  system  parameters  on  the 
effectiveness  of  the  feedback  based  flow  control  is  studied. 

The  network  time  scale  is  defined  as  the  transmission  time 
of  a  cell  and  it  decreases  as  the  network  speed  increases. 
Therefore,  decreased  network  time  scale  corresponds  to  in¬ 
creased  propagation  delay.  The  VBR  time  scale  is  defined  to 
be  a  measure  of  the  rate  of  change  of  the  VBR  source  rate 
and  the  ABR  time  scale  is  defined  as  the  minimum  distance 
among  consecutive  blocks  of  consecutive  cells  generated  by 
the  ABR  source. 


The  remainder  of  the  paper  is  organized  as  follows.  In 
section  2,  the  system  considered  in  order  to  evaluate  the 
impact  of  time  scales  is  described.  In  section  3,  a  suitable 
Markov  Chain  for  the  system  is  formulated.  Some  numerical 
results  are  presented  in  section  4  and  the  paper  ends  with 
some  conclusions  presented  in  section  5. 


2  Description  of  the  System 

In  order  to  simplify  the  analysis  and  facilitate  the  under¬ 
standing  of  the  impact  of  the  various  time  scales,  a  sys¬ 
tem  with  one  VBR  source  and  one  ABR  source  is  consid¬ 
ered  where  each  of  these  sources  can  deliver  at  most  one  cell 
per  slot.  The  propagation  delay  between  the  ABR  source 
and  the  network  access  node  is  assumed  to  be  non-negligible. 
The  ABR  source  rates  are  calculated  at  the  network  access 
node  by  using  the  current  VBR  source  rate.  The  calculated 
transmission  rates  are  fed  back  to  the  ABR  source  when  a 
VBR  source  rate  change  occurs.  Thus,  the  network  access 
node  is  supposed  to  be  able  to  detect  a  change  in  the  VBR 
source  rate.  This  may  be  possible  through  an  explicit  indica¬ 
tion  the  VBR  source  carries  by  certain  cells  or  an  estimation 
mechanism  implemented  at  the  network  access  node. 

The  network  access  node  is  assumed  to  have  a  finite  buffer 
of  capacity  C  for  the  temporary  storage  of  the  ABR  and  VBR 
cells  and  the  cell  departures  are  assumed  to  occur  before  cell 
arrivals  occurring  over  the  same  slot. 

2.1  The  ABR  Traffic  Source 

As  indicated  previously,  the  ABR  source  rate  is  calculated 
based  on  the  current  VBR  source  rate.  If  the  current  VBR 
source  rate  is  equal  to  rt, .  then  the  maximum  allowable  ABR 
source  rate  will  be  equal  to  1  —rv  —  c.  Here,  e  is  an  arbitrarily 
small  positive  number  to  ensure  that  the  maximum  system 
load  is  below  1  (stability  condition).  It  is  assumed  that  the 
ABR  source  can  always  transmit  with  its  maximum  allowable 
rate  (heavy  traffic  assumption).  With  this  assumption,  the 
worst  case  performance  for  the  ABR  source  will  be  derived. 

In  order  to  satisfy  its  maximum  allowable  rate-,  the  ABR 
source  can  transmit  one  cell  per  Tk  slots  where  Tk  = 
[~y— d  —"| ,  ln  this  paper,  Tk  will  be  referred  to  as  the  funda¬ 
mental  ABR  subframe.  Note  that,  the  same  rate  can  also  be 
achieved,  if  the  ABR  source  transmits  a  batch  with  Bk  cells 
per  subframe  of  length  BkTk  (Bk-order  subframe). 

The  ABR  time  scale  is  the  length  of  a  R^-order  subframe 
(BkTk).  Although  the  ABR  time  scale  will  be  different  un¬ 
der  different  order  subframe  transmission  schemes,  the  same 
rate  will  be  maintained.  Figure  1  shows  examples  of  trans¬ 
mission  employing  fundamental  or  higher-order  subframes. 
Note  that,  for  a  given  subframe,  the  ABR  cells  are  trans¬ 
mitted  only  for  the  first  Bk  slots.  In  the  rest  of  the  paper, 
a  fixed  batch  size  (Bk)  will  be  chosen  for  all  possible  ABR 
source  rates  and  subframes.  Thus,  a  static  ABR  time  scale 
approach  will  be  considered. 


Figure  1:  The  ABR  traffic  subframes  for  a  transmission  rate 
of  1  /Tk 

2.2  The  VBR  Traffic  Source 

Although  the  activity  level  of  the  VBR  traffic  source  is  not 
controllable  and  can  change  in  principle  at  any  slot,  a  change 
in  the  rate  requires  some  time  period  to  be  manifested.  In 
this  paper,  it  will  be  assumed  that  such  changes  occur  only 
at  subframe  boundaries  for  analysis  tract-ability. 

Let  {Sjt}fc>i  denote  a  2-state  underlying  Markov  chain 
with  state  space  5  =  {0, 1}.  {5jt}it>i  will  be  used  to  describe 
the  VBR  source  activity.  Let  rv(sk)  denote  the  rate  of  the 
VBR  source  at  the  kth  subframe,  Sk  E  S .  Without  loss  of 
generality,  it  is  assumed  that  r„(l)  >  rv(0).  The  VBR  cell 
arrival  process  is  modeled  as  a  Markov  Modulated  Bernoulli 
process.  The  probability  of  transmitting  1(0)  cell  in  a  slot  of 
the  kth  subframe  is  rv(sk)(  1  —  rv  ($*)). 

Let  Ps(sk)  denote  the  probability  that  Sk  changes  at  the 
end  of  the  kth  subframe  at  the  beginning  of  which  it  was  in 
state  Sk.  Ps($k)  wifi  reflect  the  average  time  over  which  VBR 
applications  maintain  a  constant  rate  (or  VBR  time  scale). 
If  p5(0)  =  p5(l)  =  then  the  VBR  time  scale  will  be  equal 
to  1/p,;  otherwise,  two  time  scales  may  be  defined. 

3  Analysis  of  the  System 

In  order  to  determine  the  arrival  process  at  the  network  ac¬ 
cess  node  over  the  kth  subframe,  the  network  access  node 
needs  to  know  the  current  VBR  source  rate  (rv(sjt))  and 
whether  the  current  ABR  rate  is  based  on  the  most  re¬ 
cent  feedback  sent  by  the  node  or  not  (that  is,  whether  the 
feedback’s  impact  is  not  pending  or  pending,  respectively). 
Therefore,  let  Jk  be  an  indicator  function  which  assumes  the 
value  of  1,  if  the  impact  of  a  feedback  carrying  a  VBR  source 
rate  change  is  pending.  If  the  impact  of  the  feedback  is  not 
pending,  J*  is  equal  to  0.  The  fundamental  ABR  subframe 
(thus,  the  ABR  source  rate)  will  be  completely  determined 
by  (Sfc.Jt)  and  it  is  given  by  T*  =  l~ where  0 
denotes  the  modulo  2  addition. 

A  Markov  chain  can  be  constructed  to  describe  the  evolu¬ 
tion  of  the  system  under  fixed  and  non-zero  roundtrip  prop¬ 
agation  delays.  Here,  a  simplifying  assumption  is  made  by 
assuming  that  the  roundtrip  propagation  delay  is  random 
and  geometrically  distributed  with  mean  T-,  measured  in 
terms  of  subframes. 


Although  it  is  possible  that  more  than  one  feedback  be 
pending  at  any  time,  at  most  one  pending  feedback  will  be 
considered  in  order  to  simplify  the  analysis.  This  approxima¬ 
tion  holds  true  if  maxSk  {ps(sk)}  <  pf  \  that  is,  if  the  VBR 
source  time  scale  is  much  larger  than  the  propagation  delay. 
This  is  expected  to  be  the  case  in  order  for  a  feedback  based 
rate  control  scheme  to  be  reasonably  effective. 

With  these  assumptions,  it  is  easy  to  establish  that 
the  stochastic  process  {Sk,  Jk,Qk}  embedded  at  subframe 
boundaries  is  a  Markov  chain  with  state  space  {($*,  j*,  qk)  ■ 
0  <  Sk  <  1,0  <  jk  <  1,0  <  qk  <  C}.  Here,  Qk  is  a  random 
variable  describing  the  buffer  occupancy  at  the  beginning  of 
the  kth  subframe. 

Let  P{skjkqk)sk+ijk+iqk+\)  be  the  transition  probabil¬ 
ity  that  the  Markov  chain  {S*,  Jk,  Qk}  moves  from  state 
(skjk^k)  to  state  (sjfc+i ,  jfc+i,  It  is  clear  that. 

(sk+ujk+\)  is  independent  from  the  queue  occupancy  ( qk ) 
and  thus  P(skjkqk\ Sjfc+i  can  be  expressed  as  fol¬ 

lows. 


P(8kjkqkm*  Sk+\jk+iqk+i)  =  P{skjk\ Sk+ijk-n  )P(qk+\ /  skJkqk) 

where  P(skjk<  Sk+ijk+i)  is  the  probability  of  passing  from 
(*kjk)  to  («*+],  j*+i)  and  P(qk+i/skjkqk)  is  the  probability 
of  being  at  queue  occupancy  qk+\  at  the  beginning  of  the 
next  subframe  given  the  current  subframe  parameters.  The 
derivation  of  these  probabilities  are  shown  in  detail  in  [12]. 

After  the  calculation  of  the  transition  probabilities,  the 
steady  state  probability  distribution  II(sjt,  j*,  qk)  can  be  de¬ 
rived  by  using  the  following  matrix  equations,  II  =  IIP 
and  Yh  n,  =  1  wh  ere  P  is  the  state  transition  matrix 
each  of  whose  elements  show  the  transition  probabilities 

P  ( sk  jk<lk'>Sk+\jk+\(lk+l)‘ 

In  order  to  study  the  impact  of  the  network  and  source 
time  scales,  the  ABR  cell  loss  probability  will  be  derived 
since  the  only  negotiated  QoS  parameter  for  the  ABR  ser¬ 
vice  class  is  the  Cell  Loss  Ratio  (CLR).  The  ABR  cell  loss 
probability  can  be  calculated  using  the  average  number  of 
cells  lost  and  arrived  in  a  subframe  at  the  beginning  of  which 
is  in  state  (skJk,qk)  and  taking  the  expec¬ 
tation  over  all  possible  states. 

Let.  LP  denote  the  ABR  cell  loss  probability,  and  let 
T(skJk^k)  and  JT(^Jk^k)  denote  the  number  of  cells 
lost  and  arrived  in  a  subframe  at  the  beginning  of  which 
{Sk,Jk,Qk}k>\  in  state  (skJk,qk)  respectively.  Then, 

IP  =  E{PSk^k--^-) 

M{skJk,qk) 


Let  I{sk,jk,qk;i)  (R{sk>jk,qk;i))  denote  the  number  of 
cells  lost  (arrived)  at  the  ith  slot  of  the  kth  subframe  in  state 
(sk,jk,qk)'<  1  <  i  <  BkTk .  Clearly,  R{sk,  jk,  qk\  *)  does  not 
depend  on  the  queue  occupancy  and  is  only  a  function  of  sk, 
jk  and  i.  Thus, 


R(*kJk*qk\i)  =  R(skJk\i 


A(sk) 

A(sjt )  +  1 


if  i  >  Bk 
if  i  <  Bk 


where  A(s*)  denotes  the  number  of  VBR  cell  arrivals  in  a 
slot  when  the  current  VBR  state  is  sk- 


The  number  of  lost  cells  in  a  slot  can  be  found  by  consid¬ 
ering  the  number  of  arrivals  in  the  slot,  the  buffer  occupancy 
in  the  previous  slot  and  the  buffer  capacity  C\  Therefore. 


I{skJk,qk;i 


m 

0 


if  R(skJk]i)  =  C+  1  -  +m 

otherwise 


where  qk,i-\  denotes  the  queue  occupancy  at  the  i  —  Ith  slot 
of  the  kth  subframe. 

Clearly,  the  number  of  cell  arrivals  over  a  subframe 
at  the  beginning  of  which  the  system  Markov  Process 
{■S*,-J*,Qjb}*>i  ^  in  state  (sk,  jk,  Qk)  is  the  summation  of 
R(skJk\  i)  over  all  slots  i.  Thus,  the  average  number 
of  cell  arrivals  in  a  subframe  at  the  beginning  of  which 
{Sk,  Jk,  Qk}k>i  is  in  state  (skJk,qk)  can  be  expressed  as, 


_  BkTk 

M(sk,jk,qk)  =  E{R(skJk',i)} 

1=1 

=  Bk  +  BkTkrv(sk). 

The  total  number  of  cells  lost  over  the  entire  subframe 
is  equal  to  the  summation  of  /($*,  jk,  qk\  i)  for  all  i  slots  of 
the  subframe.  Thus,  the  following  equation  is  obtained  for 

L(skJk,  qk)- 

BkTk 

L(sk,jk,qk)=  ^2  E{I(sk,jk,qk'i )} 

»=1 

After  some  manipulations,  L(sk,  jk,  qk)  can  be  shown  to 
be  equal  to, 

Bk  C  l-C+1 

L(sk,jk,qk)  =  EEE  77lP{A(Sfc)  =  C  -  l  +  TTl/qk,Sk,  Jk 

i  =  1  1  =  0  m  =  0 

x  P{qk,x- 1  =  l/qk,o  =  qk } 


Combining  L(sk,  jk,  qk)  and  M(sk,jk,qk)  the  ABR  cell 
loss  probability  can  be  expressed  as  follows. 

11  c  . 

IP  _  v-  v-  U(sk,  Jk.  gk) 

~  2^  2-^  L*  Bk  +  rv(sk)BkTk 

*k=0  Jk=0  qk-0 
Bk  C  1-C+ 1 

x  SEE  mP{A{sk )  =  C  -  l  +  m/qk,Sk,jk} 

i=l  1=0  m= 0 

x  P{qk,x-i  =  l/qk,  o  =  qk} 


4  Numerical  Results 

In  this  section,  some  numerical  results  are  presented  to  illus¬ 
trate  the  impact  of  the  network  and  source  time  scales  on  the 
performance  of  an  ABR  source  which  is  located  away  from 
the  network  access  node. 

Figure  2  illustrates  the  ABR  cell  loss  probability  as  a  func¬ 
tion  of  pj  for  ps( 0)  =  p9(  1)  =  p s  =  0.005,0.001,0.0005  and 


0.0001.  For  this  plot,  results  are  derived  for  VBR  source 
rates  rv(  1)  =  0.8  and  rv(0)  =  0.4  cells/slot  and  a  buffer  with 
capacity  C  =  40.  As  the  propagation  delay  decreases  (pf  in¬ 
creases),  the  amount  of  time  that  it  takes  for  the  ABR  source 
to  respond  to  the  feedbacks  generated  at  the  network  access 
node  decreases.  As  a  consequence,  the  amount  of  bandwidth 
mismatch  (thus  the  overutilization  periods)  decreases,  caus¬ 
ing  a  decrease  in  the  ABR  cell  loss  probability.  In  addition 
to  the  propagation  delay,  the  VBR  time  scale  (1/p*)  also  has 
a  considerable  effect  on  the  ABR  cell  loss  probability.  When 
the  VBR  time  scale  decreases  (ps  increases),  VBR  source 
rate  changes  occur  more  frequently.  By  this  way,  more  band¬ 
width  mismatch  cycles  are  initiated  resulting  in  more  ABR 
cell  losses.  Note  that  p9  <  pj  in  order  for  the  assumption  of 
no  multiple  pending  feedbacks  to  be  reasonably  accurate. 


Figure  2:  ABR  cell  loss  probability  vs.  pj  for  various  p , 


Figures  3  arid  4.  illustrate  the  ABR  cell  loss  probability 
as  a  function  of  batch  size  Bk  for  C  —  10,  15,20,25  and  30. 
The  distance  between  the  ABR  source  and  the  network  ac¬ 
cess  node  is  assumed  to  be  20000  and  200  slots  for  Figure  3 
and  Figure  4  respectively.  Since  the  VBR  source  rates  are 
assumed  to  be  rv(l)  =  0.8  and  rv(0)  =  0.4,  the  fundamental 
subframe  lengths  are  2  and  6  slots.  Thus,  a  Bjt-order  sub- 
frame  (ABR  time  scale)  has  lengths  of  2 Bk  and  6Bk  slots. 
For  a  fixed  value  of  C,  it  can  be  observed  that,  the  ABR 
cell  loss  probability  initially  decreases  as  Bk  (the  ABR  time 
scale)  increases.  This  behavior  is  reversed  when  Bk  exceeds 
a  threshold.  Thus,  for  a  given  C,  there  is  an  optimal  Bk  (or 
ABR  timescale)  that  minimizes  the  induced  ABR  cell  losses. 
The  optimal  Bk  also  depends  on  the  propagation  delay.  For 
example,  the  optimal  Bk  for  C  =  15  is  equal  to  8  in  Figure 
3  while  it  is  equal  to  2  in  Figure  4.  This  trend  may  be  at¬ 
tributed  to  the  decreasing  positive  impact  of  a  large  ABR 
time  scale  for  a  decreased  propagation  delay. 

In  addition  to  the  network  and  source  time  scales,  the 
VBR  source  rates  also  have  considerable  effects  on  the  per¬ 
formance.  Figure  5  shows  the  effects  of  the  VBR  source 
rates  on  the  ABR  cell  loss  probability.  For  a  fair  compar¬ 


Figure  3:  ABR  cell  loss  probability  vs.  Bk  for  various  values 
of  C  for  an  average  distance  of  20000  slots 


Figure  4:  ABR  cell  loss  probability  vs.  Bk  for  various  values 
of  C  for  an  average  distance  of  200  slots 


ison,  the  mean  VBR  arrival  rate  is  kept  constant  for  the 
cases  considered.  The  VBR  source  rates  for  the  first  case 
are  rv(l)  =  0.9  and  rv( 0)  =  0.3  cells/slot,  while  for  the 
second  case  rv(l)  =  0.8  and  rv(0)  =  0.4  cells/slot.  It  can 
be  clearly  observed  that  the  loss  performance  improves  as 
the  difference  between  the  two  rates  decreases.  For  the  case 
with  r„(0)  =  0.3  and  rv(l)  =  0.9,  the  fundamental  subframe 
lengths  are  2  and  11  slots.  These  subframe  lengths  corre¬ 
spond  to  ABR  source  rates  of  0.5  and  0.091  respectively. 
Therefore,  during  a  bandwidth  mismatch  initiated  by  a  VBR 
source  rate  change,  the  total  rate  on  the  link  can  increase  up 
to  1.4  causing  potentially  high  losses.  However,  when  VBR 
rates  are  rv( 0)  =  0.4  and  rv(l)  =  0.8,  the  corresponding 
ABR  rates  will  be  0.5  and  0.167.  During  a  bandwidth  mis¬ 
match,  the  total  rate  on  the  link  will  be  at  most  1.3.  Com¬ 
pared  to  1.4,  the  overutilization  in  this  case  is  decreased. 
Therefore,  a  decrease  in  the  ABR  loss  probabilities  occurs. 


Figure  5:  ABR  cell  loss  probability  vs.  pf  for  various  VBR 
arrival  rates 


5  Conclusions 

In  this  paper,  the  impact  of  network  and  source  time  scales 
on  the  performance  of  ABR  applications  is  studied.  The 
network  time  scale  is  defined  as  the  transmission  time  of  a 
cell.  The  VBR  time  scale  is  defined  in  terms  of  the  rate 
of  change  of  the  VBR  source  rate.  The  ABR  time  scale  is 
defined  as  the  minimum  distance  between  consecutive  blocks 
of  consecutive  cells  generated  by  the  ABR  source.  In  order 
to  evaluate  the  impact  of  these  time  scales,  a  system  with 
one  ABR  and  one  VBR  source  is  considered. 

While  the  impact  of  the  network  time  scale  has  been  con¬ 
sidered  in  detail  in  the  past,  that  of  other  relevant  time  scales 
has  not  been  investigated.  The  main  contribution  of  this  pa¬ 
per  is  the  study  of  the  impact  of  the  VBR  and  ABR  time 
scales  on  feedback  based  flow  control. 

Tiie  increased  VBR  time  scale  has  a  positive  impact  on  the 
ABR  loss  performance.  This  is  due  to  the  fact  that,  less  fre¬ 
quent  VBR  rate  changes  (or  bandwidth  mismatch  cycles)  oc¬ 
cur  when  the  VBR  time  scale  increases,  resulting  in  reduced 
ABR  cell  losses.  Increasing  the  ABR  timescale  (or  the  batch 
size  Bk)'  initially  decreases  the  loss  probability.  But,  after 
the  batch  size  exceeds  a  threshold,  the  situation  is  reversed. 
The  optimal  batch  size  -defined  to  be  the  one  corresponding 
to  this  threshold-  increases  with  the  buffer  capacity,  as  the 
capability  of  absorbing  larger  batches  by  the  buffer  increases 
with  increased  buffer  space.  For  large  propagation  distances, 
the  optimal  batch  size  is  large.  As  the  propagation  distance 
decreases,  the  optimal  batch  size  decreases  as  well.  This  may 
be  due  to  the  reduced  positive  impact  of  the  ABR  time  scale 
for  decreased  propagation  delay. 

The  major  conclusion  of  this  work  is  that  in  addition  to 
the  network  time  scale,  the  ABR  and  VBR  time  scales  may 
impact  substantially  a  feedback  based  flow  control  as  well. 
Thus  the  impact  of  all  system  time  scales  should  be  consid¬ 
ered  in  order  to  accurately  evaluate  the  effectiveness  of  a  flow 
control  algorithm. 
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Abstract 

The  Quality  of  Service  (QoS)  of  real  time  applica¬ 
tions  is  affected  in  a  rather  direct  manner  by  the  timely 
delivery  of  application-level  data  units  which  are  typ¬ 
ically  mapped  into  several  ATM  cells.  As  a  conse¬ 
quence,  it  would  be  reasonable  to  associate  a  common 
deadline  with  all  the  cells  in  one  data  unit  (batch)  and 
develop  a  scheduling  policy  that  aims  to  maximise  the 
number  of  data  unit  that  are  delivered  by  their  respec¬ 
tive  deadlines. 

.4  batch-based  Earliest  Due  First  scheduling  policy 
is  proposed  and  analysed  in  this  paper  under  periodic 
batch  arrival  processes  which  are  typical  of  continuous 
media  streams.  Depending  on  whether  partial  batch 
service  is  performed  or  not,  two  variations  of  this 
scheduler  are  considered.  It  is  shown  that  the  pro¬ 
posed  policies  can  be  easily  implemented  in  terms  of  a 
First  Batch  In  -  First  Batch  Out  (FBl-FBO)  and  a 
Partial  FBl-FBO  schedulers  respectively.  Numerical 
results  are  presented  and  the  induced  batch  and  cell 
loss  rates  are  compared  against  those  induced  by  the 
standard  FIFO  scheduler  as  well  as  a  FIFO  scheduler 
which  drops  expired  cells. 

1  Introduction 

Asynchronous  Transfer  Mode  (ATM)  technology  is 
expected  to  support  applications  with  a  wide  range  of 
service  requirements  and  traffic  characteristics.  Real 
time  applications,  such  as  voice  and  video  are  par¬ 
ticularly  demanding  since  such  applications  require 
stringent  guarantees  on  the  Quality-of-Service  (QoS) 
in  terms  of  bounded  delay,  loss  and  jitter.  The  traffic 
characteristics  of  real  time  applications  can  be  very 
diverse  as  they  combine  a  variety  of  media  data  such 
as  text,  audio,  graphics,  images  and  full  motion  video. 
The  transmission  of  continuous  media  traffic  is  an  in¬ 
herently  difficult  problem  due  to  the  time  sensitive 
nature  and  the  traffic  variability  of  these  applications. 

[i][6][s] 

Consider,  for  instance,  the  transmission  of  video  ap¬ 
plications  in  which  it  is  required  that  frames  (images) 
are  transmitted  at  a  certain  rate  in  order  to  guarantee 
no  starvation  or  overflow  at  the  receiving  end.  Each 
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frame  would  be  of  variable  size  fin  bits)  according  to 
its  complexity  and  the  coding  scheme  employed.  The 
frame  could  be  considered  as  an  application  data  unit 
which  is  central  to  both  the  traffic  generation  process 
at  the  transmitter  and  the  reconstruction  process  at 
the  receiver.  Such  a  role  may  be  played  by  a  smaller 
encoding  unit  than  the  frame  such  as  a  rnacroblock. 
[•] 

Application  data  units,  such  as  the  frame,  are  or¬ 
ganised  into  small  fixed  size  and  standardised  packets, 
the  cells,  in  order  to  be  transported  over  an  ATM  net¬ 
work.  Typically,  an  application  data  unit  is  mapped 
into  several  ATM  cells.  When  network  resources  are 
reserved  at  peak  demand,  all  cells  of  the  application 
data  unit  will  be  delivered  on  time,  leading  to  a  timely 
and  error-free  reconstruction  at  the  receiver.  When 
resources  are  over-allocated  to  a  number  of  applica¬ 
tions  -to  increase  network  utilisation-  all  cells  of  an 
application  data  unit  may  not  be  delivered  before  this 
application  data  unit  is  required  by  the  reconstruction 
process.  In  this  case,  application  data  units  may  be 
significantly  distorted  due  to  missing  cells  or  be  im¬ 
possible  to  process  at  all.  The  required  QoS  for  such 
applications  may  then  be  defined  in  terms  of  a  metric 
associated  with  the  correctly  and  timely  delivered  ap¬ 
plication  data  units.  An  application-level  QoS  metric 
such  as  the  application  data  unit  loss  rate  would  de¬ 
scribe  more  accurately  the  QoS  delivered  to  the  end 
user,  than  a  network-level  QoS  metric  such  as  the  cell 
loss  rate.  [2] 

In  order  to  achieve  QoS  delivery  to  real  time  ap¬ 
plications  while  efficiently  utilising  the  network  re¬ 
sources  it  is  necessary  that  allocation  of  the  resources 
(scheduling)  be  based  on  the  applications’  QoS  re¬ 
quirement.  For  instance,  the  timely  delivery  of  infor¬ 
mation  associated  with  real  time  applications  suggests 
that  deadlines  could  be  defined  and  deadline-driven 
scheduling  policies  be  designed.  If  the  nature  of  an 
application  requires  that  a  complete  application  data 
unit  should  be  received  before  it  can  be  processed  by 
the  receiver,  then  it  would  be  reasonable  to  associate  a 
common  deadline  with  all  the  cells  in  the  same  data 
unit  and  develop  scheduling  policies  that  aim  to  max¬ 
imise  the  number  of  data  unit  that  are  delivered  by 
their  respective  due  date.  This  approach  is  different 
from  many  traditional  approaches  in  which  every  cell 
is  assigned  its  own  deadline  and  the  scheduling  policy 


typically  attempts  to  maximise  the  number  of  individ¬ 
ual  cells  transmitted  by  their  due  date. 

In  this  paper  a  scheduling  policy  which  would  be 
appropriate  for  supporting  applications  such  as  those 
described  above  is  introduced.  This  policy  attempts 
to  maximise  the  number  of  application  data  units  - 
referred  to  as  batches  in  the  rest  of  the  paper-  served 
before  their  deadlines. 

Unlike  the  typical  cell-based  Earliest  Due  First 
(EDF)  scheduling  policy  which  requires  sorting  on  a 
cell-by-cell  basis  [4],  the  proposed  policy  requires  such 
sorting  on  a  batch-by-batch  basis,  reducing  the  pro¬ 
cessing  burden.  For  continuous  media  applications  in 
which  batches  (application  data  units)  are  delivered 
over  well  defined  periods  the  processing  burden  of  the 
proposed  policy  can  be  substantially  reduced  by  or¬ 
dering  the  multiplexed  data  streams  according  to  their 
respective  deadlines  and  serving  them  according  to  the 
First  Batch  In  -  First  Batch  Out  (FBI-FBO)  policy, 
as  it  will  become  clear  in  later  sections.  The  complex¬ 
ity  of  this  policy  is  similar  to  that  of  a  (cell-based) 
FIFO  policy  applied  to  the  deadline  ordered  streams. 
The  performance  of  the  FBI-FBO  and  the  FIFO  poli¬ 
cies  is  evaluated  in  terms  of  the  induced  ceil  loss  and 
batch  loss  probabilities.  In  view  of  the  earlier  discus¬ 
sion  the  batch  loss  probability  may  be  a  more  relevant 
metric  for  applications  in  which  higher  layer  protocol 
data  units  are  fragmented  into  a  number  of  ATM  cells; 
these  data  units  may  be  considered  to  be  lost  if  any 
one  of  their  cells  are  lost,  unless  some  redundancy  is 
introduced  in  the  data  unit  to  improve  the  probability 
of  correct  reception  at  the  expense  of  increasing  the 
traffic  in  the  network. 

2  System  Description 

This  section  describes  the  system  and  the  traffic 
environment  considered  in  this  work.  A  network  node 
with  N  incoming  and  one  outgoing  links  is  considered. 
All  links  are  assumed  to  have  the  same  capacity  and, 
thus,  a  cell  requires  a  fixed  amount  of  time  for  trans¬ 
mission,  referred  to  as  a  slot.  All  arrivals  and  de¬ 
partures  from  the  node  occur  at  slot  boundaries.  Let 
J  denote  an  incoming  data  stream  (application);  J 
takes  values  in  the  range  0  <  j  <  N  —  1.  A  vari¬ 
able  with  superscript  j  represents  a  quantity  associ¬ 
ated  with  stream  j . 

The  incoming  traffic  is  assumed  to  be  bursty  peri¬ 
odic.  It  consists  of  variable  size  bursts  of  cells  arriving 
at  fixed  time  intervals  (periods);  all  applications  are 
assumed  to  have  the  same  period,  T.  An  example  of 
a  batch  arrival  pattern  can  be  seen  in  Figure  1.  This 
type  of  traffic  is  characteristic  of  many  continuous  me¬ 
dia  applications  such  as  digital  audio  and  video.  It  is 
further  assumed  that  the  bursts  of  stream  j  arrive  over 
consecutive  time  slots,  that  is  in  the  form  of  batches, 
and  that  their  length,  A7,  follows  a  general  distribu¬ 
tion  with  expected  value  A7.  All  batches  of  the  same 
application  j  are  assumed  to  have  the  same  deadline, 
DJ ,  which  is  provided  to  the  server  upon  connection 
set  up.  The  deadline  determines  the  maximum  time 
(measured  in  cell  slots)  which  the  batch  may  remain 
in  the  node  once  the  header  of  the  batch  has  arrived. 


This  includes  both  the  delay  and  the  transmission  de¬ 
lay. 

Figure  2  depicts  the  instants  at  which  batch  arrivals 
occur  for  N  =  4  streams.  Let  {tk}k>o  denote  the  se¬ 
quence  of  batch  arrival  instants  (from  any  stream); 
indicates  the  arrival  instance  of  the  kth  batch.  Vari¬ 
ables  with  subscript  k  will  represent  the  value  of  the 
associated  quantity  at  time  f*.  Let  r*  denote  the  in¬ 
terval  between  two  consecutive  batch  arrivals  (from 
any  stream);  that  is  : 

Tfc  =  *jb+i-<*  with  (1) 

r*  =  r;  if  J k  =  j,  where  r}  denotes  the  in¬ 
terarrival  time  between  two  consecutive  batches  from 
streams  j  and  (J  +  1  )mod(X),  or  equivalently,  the  dis¬ 
tance  between  two  consecutive  batch  arrival  instants 
of  streams  j  and  ( j  +  l)rnod(N).  In  the  rest  of  this 
paper  it  is  assumed  that  the  streams  are  ordered  in 
terms  of  increasing  deadlines.  That  is,  D*7  <  for 

0  <  j  <  N  -  2. 

As  it  was  stated  earlier,  the  goal  of  the  proposed 
policy  is  the  timely  delivery  of  complete  batches.  This 
goal  suggests  that  the  scheduler  should  serve  batches 
based  on  the  Earliest  Due  First  (EDF)  policy.  In 
view  of  the  deadline  ordering  and  the  periodicity  of 
the  batch  arrival  process  it  is  easy  to  establish  that 
a  scheduler  that  serves  batches  according  to  the  First 
In  First  Out  policy  does  serve  them  according  to  the 
EDF  policy.  This  proposed  policy  is  referred  to  as  the 
First  Batch  In  -  First  Batch  Out  (FBI-FBO)  policy 
and  is  discussed  in  more  detail  below. 

2.1  The  FBI-FBO  Scheduling  Policy 

Implementation  of  this  policy  requires  that  infor¬ 
mation  regarding  the  length  (in  cells')  of  a  batch  be 
available  to  the  scheduler  upon  batch  arrival.  This 
will  be  the  case  if  the  batch  size  is  fixed  and  provided 
to  the  scheduler  upon  connection  set  up  or  if  this  in¬ 
formation  is  carried  by  the  first  cell  of  the  batch  or  by 
dedicated  cells  such  as  Resource  Management  Cells. 

Upon  Batch  arrival  the  scheduler  decides  whether 
this  batch  can  be  served  by  its  deadline  or  not.  In  the 
former  case,  the  batch  is  served  before  its  deadline 
expires  as  described  below.  In  the  latter  case,  the 
entire  batch  is  discarded. 

The  scheduler  is  equipped  with  a  service  queue  to 
which  cells  are  shifted  for  storage  before  they  are 
transmitted.  All  cells  shifted  to  the  service  queue  will 
be  transmitted,  following  the  first  cell  in  -  first  cell 
out  policy.  Suppose  that  a  batch  from  stream  j  ar¬ 
rives  at  time  £*.  If  the  sum  of  the  batch  length  A7 
and  the  service  queue  occupancy  Q  does  not  exceed 
the  batch  deadline,  LP ,  a  reservation  is  made  in  the 
service  queue  for  a  space  of  A  slots.  The  service  queue 
occupancy  is  then  registered  as  being  ( Q  +  A7),  even 
though  only  the  first  cell  has  arrived  so  far.  The  re¬ 
maining  (A7  —  1)  cells  of  the  batch  will  be  arriving  over 
the  next  (A7  —  1)  time  slots  and  they  will  be  shifted 
to  the  reserved  space  in  the  service  queue  immediately 
upon  arrival.  In  other  words,  data  stream  j  ’sees'  a 
logical  service  queue  of  capacity  D*7  and  decides  to  ini¬ 
tiate  a  batch  shifting  to  the  service  queue  only  if  the 
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Figure  1:  An  example  batch  arrival  pattern. 


following  condition  is  met  upon  arrival: 

Q  +  Aj  <  &  (2) 

This  condition  guarantees  that  the  cells  will  be  sent  to 
the  output  link  within  D*  time  slots  and,  thus,  they 
will  meet  their  deadline.  If  the  associated  logical  queue 
overflows,  then  the  dropped  cells  will  be  the  ones  to 
miss  the  batch  deadline.  In  this  case  no  cell  of  the 
batch  is  shifted  to  the  service  queue. 

It  should  be  noted  that  the  server  attends  to  a 
newly  arrived  batch  once  upon  the  arrival  of  the 
header,  determines  whether  it  will  meet  its  deadline  or 
not  and  decides  whether  to  serve  it  or  drop  it.  Once  a 
scheduling  decision  is  made,  it  is  no  longer  necessary 
to  keep  a  record  of  the  batch  arrival,  its  length  or  its 
relative  deadline.  The  above  scheduling  policy  is  work 
conserving  in  the  sense  that  the  server  will  not  remain 
idle  as  long  as  there  are  batches  waiting  to  be  served. 
If  the  service  queue  becomes  empty,  the  server  will 
become  idle  and  will  await  for  the  next  batch  arrival. 

2.2  The  Partial  FBI-FBO  Scheduling  Pol¬ 
icy 

In  an  environment  in  which  complete  batches  need 
to  be  transmitted  before  their  deadline  in  order  to 
be  useful,  a  scheduling  policy  that  drops  expired  cells 
is  more  effective  than  the  one  which  serves  expired 
cells,  in  the  sense  that  a  larger  fraction  of  batches  can 
be  transmitted  before  their  deadlines.  In  order  for 
the  scheduler  to  determine  whether  an  entire  batch 
will  meet  its  deadline  or  not,  information  about  the 
length  of  the  batch  must  be  provided  upon  batch  ar¬ 
rival.  This  is  the  assumption  made  in  the  FBI-FBO 
policy  described  above. 

If  the  batch  length  information  is  not  provided  upon 
batch  arrival  the  scheduler  would  attend  to  each  cell  of 
the  batch  individually  by  checking  the  service  queue 
occupancy  against  the  remaining  time  till  the  dead¬ 
line  expiration  on  a  cell-by-cell  basis.  In  this  case, 
though,  the  scheduler  may  serve  part  of  a  batch  and 
drop  the  remaining  cells  which  will  not  meet  the  dead¬ 
line.  The  entire  batch  would  then  be  lost.  The  vari¬ 
ation  of  the  FBI-FBO  policy  -referred  to  as  Partial 
FBI-FBO  policy-  is  considered  in  this  paper  as  well. 
This  is  similar  to  the  FBI-FBO  policy  with  the  differ¬ 
ence  that  the  server  serves  the  fraction  of  the  batch 


that  can  meet  its  deadline.  Clearly,  a  smaller  num¬ 
ber  of  complete  batches  and  a  higher  number  of  cells 
would  be  served  under  the  Partial  FBI-FBO  policy. 

3  Performance  Analysis 

In  this  section  a  more  precise  notation  is  introduced 
and  the  derivation  from  first  principles  of  the  equa¬ 
tions  that  mathematically  describe  the  system  is  pre¬ 
sented. 

Let  Jk  denote  the  data  stream  from  which  a  new 
batch  arrival  occurs  at  time  f*.  The  content  of  the 
new  arrival  is  to  be  shifted  to  the  service  queue  be¬ 
ginning  at  time  tk .  Let  Qk  denote  the  service  queue 
occupancy  at  time  tk ,  after  it  has  been  updated  by 
the  number  of  cells  in  the  new  batch  from  stream  Jk- 
In  view  of  the  above,  the  system  process  {Qt,^}jb>o 
can  be  defined  as  a  2-D  process  with  state  space 
^  V*  Dmax  1 0  5:  Jk  5:  N  —  1}.  Dmax 
denotes  the  maximum  deadline  of  all  N  stream  dead¬ 
lines;  that  is  Dmar  =  max;  { iy } .  Note  that  for 
Jk  the  only  transitions  allowed  are  from  some  jk  to 
jk + 1  =  {jk  +  l)mod(N).  Based  on  the  above  defini¬ 
tion  of  the  system  process,  it  is  easy  to  establish  that 
it  is  a  Markov  chain.  This  can  be  established  by  not¬ 
ing  that  J k  is  a  periodic  random  variable  and  that  the 
next  service  queue  occupancy,  Qk+\,  can  be  proba¬ 
bilistically  determined  from  the  current  one,  Q*,  and 
the  new  batch  arrival,  A*k  +  l;  A;k  denotes  the  random 
variable  representing  the  number  of  cells  from  stream 
jk  that  arrived  at  time  l*-  The  remaining  of  this  sec¬ 
tion  contains  the  derivation  of  the  probability  that  the 
Markov  chain  moves  from  any  state  ( qk,jk )  to  state 
(7*+i  Jifc+i),  given  by  P(qk}  jk,  f*+i»  jit+i). 

3.1  FBI-FBO  Policy 

Under  this  policy,  cells  of  a  batch  are  served  only  if 
the  entire  batch  can  meet  its  deadline.  Three  distinct 
cases  can  be  identified,  based  on  the  queue  occupancy 
of  the  service  buffer  at  the  instant  of  a  new  batch  ar¬ 
rival.  In  each  of  the  following  cases  0  <  jk  <  N  —  1 
and  jjb+i  =  (jk  +  1  )mod(N). 

Case  A:  for  0  <  qk  <  Tk.  In  this  case  all  the  cells 
in  the  buffer  will  have  been  served  by  the  time  the 
next  batch  arrives  from  stream  jjt+i  at  time  4+1  and 
the  new  buffer  occupancy  will  be  equal  to  the  number 


Stream  0 


k 

k  +  4 

k  +  8 

T 

k  +  I 

k  +  5 

k  +  9 

Stream  2 


k  +  2 


k  +  6 


k  +  !0 


Stream  3 


k-  1 


k  +  3 


k  +  7 


t  k-4  k-3  k-2  k-1  k  k+1  k+2  k+3  k+4  k+5  k+6  k+7  k+8  time 


Figure  2:  Periodic  arrival  pattern  with  shifted  relative  arrival  instants  for  N  =  4  traffic  streams. 


of  new  arrivals  only: 

P(<lk,jk,qk+l,jk+\)  =  P{A’k+'  =  <7*  +  l} 

for  0  <  qk+ 1  <  T  (3) 

Case  B:  for  r*  <  qk  <  Z>'l‘+I  +  r*  —  1.  In  this  case 
by  the  time  the  next  batch  arrives  the  service  buffer 
occupancy  will  have  dropped  to  (<7* —  7*).  Hence,  with 
the  new  arrival  A*k+l  the  occupancy  will  be  updated 
to  (qk  —  rk  4*  A ;fc  +  ‘)  if  the  entire  batch  can  meet  its 
deadline  DJit+* : 

P(qk,jk,qk+i,jk+\)  =  PM’""1'1  =  qk+i  -  (qk  -  n)} 

for  (qt  -  Tk)  <  qk+i  <  Djk+'  (4) 

Otherwise,  the  new  batch  will  be  dropped  and  the 
service  buffer  occupancy  will  become  (<7*  —  7>): 

T 

P(qk,jk,qk+i,jk+i)  =  P{AJk+'  =  n} 

n  =  Difc  +  l  -(qk  -rfc)+ 1 

for  qk  +  x  =  qk  -  Tk  (5) 

Case  C:  for  DJfc+1  +rjk  <  qk  <  Drnax *  Hi  this  last  case 
the  new  batch  will  definitely  miss  its  deadline  and  thus 
it  is  dropped  with  probability  1: 

P(<!kJk,qi:+ujk+\)  =  1  for  ?jt  +  i  =  qk  -  n  (6) 

3.2  Partial  FBI-FBO  Policy 

This  policy  allows  for  a  batch  to  be  either  partially 
or  fully  served.  In  the  event  where  only  a  fraction  of 
the  batch  can  meet  its  deadline  those  cells  are  shifted 
to  the  service  buffer  and  the  remaining  are  dropped. 


Again  three  distinct  cases  can  be  identified.  Cases  A 
and  C  are  the  same  as  in  the  above  section. 

Case  B:  for  rk  <  qk  <  Djk+X  +  Tk  —  1.  In  this  case 
depending  on  the  length  of  the  new  batch  there  are 
two  distinct  possibilities  of  full  or  partial  service: 

Full  service: 

P{qk,jk,qk  +  l,jk+l)  =  P{AJk  +  '  =  qk  + 1  -  (qk  -  V*)} 

for  (qk  -  Tk)  <  qk  +  l  <  iyk *'  ~  1  (") 

Partial  service: 

T 

P(qk,jk,qk+i,jk+i)  =  P  {A'‘+1  =  n} 

n  =  Difc  +  l  -{qk-Tk) 

for  qk+l  =  (8) 

4  Computation  of  Batch  Loss  and  Cell 
Loss  probabilities 

Apart  from  the  cell  loss  probability  that  can  be 
used  in  order  to  assess  the  performance  of  the  pro¬ 
posed  policies,  another  meaningful  metric  would  be 
the  batch  loss  probability.  A  batch  is  considered  to  be 
lost  unless  all  its  cells  are  served.  The  batch  or  cell 
loss  probabilities  are  easily  computed  by  involving  the 
Markov  system  process. 

4.1  Batch  Loss  Probability 

Since  a  batch  is  considered  lost  if  one  of  its  cells  is 
lost,  the  analysis  for  the  batch  loss  probability  is  the 
same  for  both  policies. 

Let  Lu(qk,jk  +  \)  denotes  the  number  of  batches 
that  are  dropped  over  the  interval  [£*+1,4+2),  where 
j*  + 1  is  the  data  stream  the  scheduler  will  attend  to 


over  this  interval  (following  stream  jk)  and  qk  is  the 
queue  occupancy  at  t*.  Clearly: 

^  jk+\)  =  1{4>*  +  1  >£>Jic+i  (^) 

The  conditional  expected  value  of  this  indicator 
function  given  that  the  system  is  in  state  (qk,j ji*+i) 
is  given  by: 


LB(qk,jk+i)  =  E{1 

{.4^+*  >D’*+i -(?k-Tfc)}  }  (io) 

It  represents  the  average  number  of  batches  lost  as  a 
function  of  the  queue  occupancy  at  the  time  instant  tk 
and  the  data  stream  arrival  that  occurs  at  \  under 
both  policies. 

If  0  <  qk  <  r*  then  clearly  there  will  be  no  losses 
since  the  server  will  serve  all  the  cells  in  the  service 
buffer  during  the  interarrival  time  r*  between  streams 
jk  and  jjb+i  and  thus: 

L^(qk,jk+ 0  =  o  (li) 

LB(qk,jk+ 0  =  0  (12) 

If  <  gjt  <  Dmai  then  losses  may  occur.  The  con¬ 
ditional  expected  value  of  those  losses  in  any  state 
(qk.jk+i)  is  given  by: 


r 

L“(qk,jk+l)  =  P{A»+'=n} 

n  =  D,k  + »  -($k-T*)+  1 

(13) 

Hence,  the  overall  average  value  of  the  batch  losses  LB 
over  any  interval  r *  will  be: 

Z*  = 

A'-l  Om.r  T 

]C  H  P{i4;k+1  =  n}ir(qk,jk), 

J= 0  ?ic=rfc  +  l  n  =  DJ‘  +  i  _(?fc_Tk)+i 

(14) 

where  tt( ,  jk)  is  the  steady  state  probability  distri¬ 
bution  of  the  corresponding  Markov  Chain. 

Finally,  the  batch  loss  probability  is  given  by: 


Pb  = 


Ejbatches  lost  over  any  interval  r*}  _  LB  -j-g 

E{ batches  arrived  over  anv  interval  }  I 

(15) 

The  denominator  is  equal  to  1  since  by  definition  of 
the  system,  there  is  only  one  batch  arrival  over  any 
interarrival  interval  7*. 

4.2  Cell  Loss  Probability 

The  derivation  of  the  cell  loss  probabilities  is  similar 
for  both  policies  and  follows  the  same  procedure  as  in 
the  above  section.  Under  the  FBI-FBO  policy,  the 
cell  losses  correspond  to  the  cells  of  the  batches  which 
did  not  meet  the  deadline.  Whereas  under  the  Partial 


FBI-FBO  policy  the  lost  cells  can  be  a  fraction  of  the 
batch’s  total  number  of  cel  s. 

Again  let  Lc(qk,  Jjt+i )  denote  the  number  of  cells 
dropped  over  the  interval  /*+i ,  **+2),  where  j*+t  is 
the  data  stream  the  scheduler  will  attend  to  over  this 
interval  and  qk  is  the  queue  occupancy  at  tk.  Also, 

let  Lc (qk,jk+\)  denote  the  conditional  expected  value 
given  that  the  system  is  in  state  (?jt,  jfc  +  i)- 

As  in  the  previous  section,  if  0  <  7*  <  7*  then 
clearly  there  will  be  no  losses  since  the  server  will  serve 
all  the  cells  in  the  service  buffer  during  the  interarrival 
time  Tk  between  streams  jk  and  jk+ 1  and  thus: 


LC((ik,jk+\)  =  0  (1G) 

Lc(qk,jk+i)  =  0  (17) 

for  both  policies. 

If  Tk  <  </jb  <  Drnar  then  losses  may  occur.  For 
the  FBI-FBO  policy  the  conditional  average  number 
of  cells  lost  over  any  interval  7*  is  given  by: 


_  r 

L\(qk,jk+i)  =  nP{^‘+l  =  n} 

n  =  DJfc  +  l  -(</*  -r*;  )+l 

(18) 

The  same  quantity  for  the  Partial  FBI-FBO  policy  is 
given  by: 


L?  (qk ,  jk+\)  — 


T 

[n-(Dj>+>-(qk-Tk))]P{Aik+'  =  n} 

r.  =  D'‘  +  i-(„-rk)+l 

(19) 

The  overall  average  values  of  L^(qk,jk+\)  and 
£2  (qk,jk+i)  >s  given  by: 


Li(qk,jk+\)  = 

n-  1  r 

H  L  E  nP{AJk+'  =  n}ir(qt,jk) 

j  =  0  ?fc=r*  +  1  n  =  D^  +  i -(<7fc-r*)+l 

(20) 


Ln(qk,jk+\)  = 

N-\  T 

£  P{A}-+'  =  n}[n-{iyk+'-(qk 

j=0  ?fc=rfc+l  n  =  DJfc+l 

(21) 

where  v(qk,jk)  is  the  steady  state  probability  distri¬ 
bution  of  the  corresponding  Markov  Chain. 

Finally,  the  cell  loss  probability  is  given  by: 


p  _  £{cells  lost  over  any  interval  7*} 

£7{ cells  arrived  over  any  interval  7*}  ^ 


The  numerator  is  given  by  Equations  (20)  and  (21). 
The  denominator  represents  the  average  number  of 
cells  that  arrive  in  the  system  over  any  interval  t*.  It 
can  be  computed  by  summing  the  mean  batch  length 
of  each  data  stream  weighted  by  the  probability  that 
a  batch  that  arrives  in  any  given  interval  r*  belongs  to 
a  particular  data  streams.  Since  all  the  data  streams 
have  the  same  frequency  of  batch  arrivals  in  a  period 
the  weighted  sum  over  any  interval  t*  is  given  by: 


5  Numerical  Results  and  Discussion 

In  this  section,  the  performance  of  the  proposed 
policies  is  compared  to  that  of  the  FIFO  policy  which 
serves  the  deadline  ordered  streams  on  a  cell-by-cell 
basis.  Comparison  is  based  on  numerical  results  for 
the  batch  loss  and  cell  probabilities  induced  by  these 
policies. 

The  induced  losses  under  the  proposed  policies  have 
been  computed  using  equations  (14),  (15),  (20),  (21), 
(22)  and  (23)  while  those  under  the  FIFO  policy  have 
been  computed  through  simulations.  For  each  simula¬ 
tion  run  10'  batch  instants  have  been  generated  from 
each  stream. 

Two  versions  of  the  FIFO  policy  have  been  con¬ 
sidered:  FIFO  policies  with  and  without  an  expired 
cell  discarding  mechanism.  The  first  one  assumes  no 
knowledge  of  the  deadlines  and  simply  serves  the  cells 
on  a  first-come  first-served  basis.  This  is  referred  to  as 
the  pure  FIFO  approach  in  this  paper  and  it  is  simple 
and  straightforward  in  its  implementation.  The  pro¬ 
posed  policy  is  compared  to  the  pure  FIFO  in  order  to 
evaluate  their  relative  performances  and  to  determine 
if  a  significant  gain  can  be  achieved  by  employing  a 
more  involved  scheduling  policy.  The  second  FIFO 
policy  that  is  considered  has  an  expired  cell  discard¬ 
ing  mechanism.  It  identifies  and  drops  those  cells  that 
can  not  meet  their  deadline.  In  order  for  the  expired 
cells  to  be  dropped  they  typically  need  to  be  searched 
for.  The  computational  overhead  in  this  case  is  quite 
significant  for  a  large  number  of  streams.  Another 
drawback  of  the  FIFO  approach  is  that  cells  from  dif¬ 
ferent  batches  are  multiplexed  at  the  service  queue. 
This  causes  the  shape  of  the  data  traffic  to  change 
and  one  information  unit  spreads  over  a  longer  time 
interval  in  the  output  link  as  compared  to  its  original 
length. 

5.1  Homogeneous  Traffic 

Initially  a  homogeneous  traffic  environment  is  con¬ 
sidered  whereby  the  traffic  from  all  data  streams  is 
statistically  identical  and  has  the  same  performance 
requirement  (that  is  common  delay  tolerance).  The 
batch  lengths  are  geometrically  distributed.  The  in¬ 
terarrival  interval  r*  is  constant  and  equal  to  r  =  -  (r 
is  always  rounded  to  the  nearest  integer).  The  system 
utilisation  can  be  varied  in  proportion  to  the  average 
batch  length  and  the  period  T. 

Figure  3  shows  the  batch  loss  probability  under  the 
FBI-FBO  policy  (referred  to  here  as  Policy  1),  the 


Partial  FBI-FBO  policy  (referred  to  here  as  Policy  2) 
and  the  FIFO  with  the  expired  cell  discarding  mecha¬ 
nism  as  a  function  of  the  system  utilisation  for  N  =  4, 
T  =  40  and  identical  deadlines  D;  =  D.  The  following 
deadlines  have  been  considered:  D  =  40  (solid  line)  , 
D  =  60  (dotted  line)  and  D  =  80  (dashed  line).  In 
other  words,  the  deadline  has  been  set  to  1,  and  2 
frames  (periods).  Note  that  as  t lie  deadlines  increase 
the  losses  decrease  as  was  expected. 

Figure  4  shows  how  the  batch  loss  probability 
changes  as  a  function  of  the  o Acred  load  for  all  four 
policies  for  N  =  4  and  T  —  40.  The  results  are  for 
deadlines  equal  to  1  and  1^  frames.  The  top  line  with 
x-marks  corresponds  to  the  FIFO  without  the  expired 
cell  discarding  mechanism,  the  line  with  stars  corre¬ 
sponds  to  the  FIFO  with  expired  cell  discarding,  the 
line  with  circles  corresponds  to  Policy  2  and  the  bot¬ 
tom  solid  line  is  Policy  1.  Among  the  four  policies, 
Policy  1  provides  the  lowest  batch  loss  rate.  For  low 
system  utilisation  it  performs  better  than  the  FIFO 
approaches  by  approximately  an  order  of  magnitude. 
As  the  system  utilisation  increases  Policy  2  and  the 
FIFO  with  the  expired  cell  discarding  mechanism  tend 
to  converge  to  a  common  batch  loss  rate  whereas  the 
losses  of  the  pure  FIFO  increase  at  a  faster  rate. 

Similar  observations  can  be  made  when  the  num¬ 
ber  of  data  streams  is  increased  to  N  =  S.  Figure 
5  shows  the  results  for  deadlines  equal  to  1  and 
frames  and  T  =  64.  Again  the  policies  at  the  two 
extreme  ends  are  Policy  I  and  the  pure  FIFO.  Policy 
1  provides  the  lowest  batch  loss  rate  among  the  four 
and  outperforms  the  pure  FIFO  by  at  least  one  order 
of  magnitude.  Policy  2  anti  the  FIFO  with  the  ex¬ 
pired  cell  discarding  mechanism  tend  to  converge  for 
high  system  utilisations. 

So  far,  only  the  batch  loss  rates  have  been  consid¬ 
ered.  Figure  6  shows  the  cell  loss  rates  for  the  cases 
considered  in  Figure  5.  Notice  that  Policy  2  (line  with 
circles)  performs  better  than  Policy  1  (lower  solid  line) 
as  expected.  However,  the  performance  of  Policy  1  is 
the  same  or  better  than  that  of  the  pure  FIFO  (upper 
line  with  x-marks). 

From  the  above  results  it  can  be  clearly  concluded 
that  Policy  1  has  the  lowest  batch  loss  rate  while  at 
the  same  time  it  keeps  the  cell  loss  rate  at  the  same 
or  lower  level  than  that  of  the  pure  FIFO  approach. 
The  performances  of  Policy  2  and  of  the  FIFO  with 
the  expired  cell  discarding  mechanism  are  generally  in 
the  middle  with  higher  batch  loss  rates  and  slightly 
lower  or  equal  cell  loss  rates  than  Policy  1. 

Increasing  the  deadline  of  the  input  traffic  to  2 
frames  or  more  (not  shown  in  this  paper)  resulted 
in  about  the  same  performance  under  all  policies.  In 
other  words,  if  the  deadlines  are  long  enough  the  losses 
become  independent  of  the  particular  scheduling  pol¬ 
icy  and  they  depend  on  the  traffic  load  alone. 

Figures  7  and  8  present  the  results  under  all  four 
policies  under  a  batch  size  distribution  delivering  a 
batch  size  of  length  L  cells  with  probability  p  and 
length  1  with  probability  1  —  p  (1  -  L  bursty  periodic 
traffic  model).  The  system  parameters  are:  N  =  8, 
T  =  64,  L  =  36,  p  =  0.2  and  deadlines  equal  to  1  and 


1  ~  frames. 

Figure  7  shows  the  batch  loss  probability  as  a  func¬ 
tion  of  the  offered  load.  Once  again  Policy  1  achieves 
the  lowest  batch  loss  rate  whereas  the  pure  FIFO  pol¬ 
icy  has  the  highest. 

The  cell  loss  probability  is  shown  in  Figure  8.  As  it 
can  be  seen,  Policy  1  induces  equal  or  lower  cell  loss 
rate  than  the  pure  FIFO  approach,  as  before.  Again 
the  losses  spread  over  mostly  three  orders  of  magni¬ 
tude  as  opposed  to  five  or  six  orders  under  geometri¬ 
cally  distributed  batches. 

5.2  Inhomogeneous  Traffic 

An  inhomogeneous  traffic  environment  has  also 
been  considered.  The  objective  in  this  case  is  to  eval¬ 
uate  the  level  of  diversity  in  the  induced  performance. 
Out  of  the  N  =  8  data  streams,  two  are  assumed  to 
deliver  probabilistically  larger  batches  and  since  the 
period  is  the  same  for  all  streams,  have  larger  load,  r 
is  maintained  constant  for  all  streams.  The  six  (back¬ 
ground)  streams  are  assumed  to  deliver  batches  of  size 
1  or  L  with  parameter  p  (as  before)  with  L  —  10  and 
p  —  0.2.  The  two  (more  active  streams)  streams  have 
parameters  L  and  p  =  0.4,  where  L  assumes  a  value 
between  8  and  56  so  that  a  specific  system  utilisation 
is  achieved. 

Figure  9  shows  the  batch  loss  rates  for  the  two  more 
active  streams  (data  streams  1  and  4)  and  the  six 
background  streams  under  all  four  policies.  It  can 
be  observed  that  Policy  1  induces  the  lowest  losses 
for  the  two  more  active  streams  which  are  of  the  or¬ 
der  of  10”2  —  10“3  and  are  observed  only  for  a  sys¬ 
tem  load  higher  than  0.7.  The  other  policies  induce 
higher  losses  for  a  wider  range  of  system  utilisation. 
The  two  FIFO  policies  induce  approximately  the  same 
losses  which  are  higher  than  those  under  Policy  1  by  at 
least  an  order  of  magnitude.  For  the  six  background 
streams  Policy  1  induces  a  loss  rate  of  the  order  of 
X 0“3  —  1  O'"4  for  system  utilisation  higher  than  0.7  only. 

To  further  reduce  the  batch  losses  an  attempt  was 
made  to  adjust  the  streams’  interarrival  time  r  in  pro¬ 
portion  to  the  expected  batch  length  of  each  stream. 
Increasing  the  values  of  r  for  streams  with  high  mean 
batch  lengt  h  could  result  in  a  reduction  of  the  average 
occupancy  of  the  service  buffer  since  ’busy’  streams 
will  now  have  more  time  to  be  served  before  the  next 
batch  arrival  occurs.  The  value  of  r  has  been  set  using 
the  following  equation: 


T* 


(24) 


and  is  rounded  to  the  nearest  integer  as  before. 

Figure  10  shows  the  batch  losses  for  Policy  1  with 
equal  values  of  r  (solid  line)  and  with  weighted  values 
of  r  (dashed  line).  It  can  be  noted  that  adjusting  the 
value  of  t  hardly  affects  the  loss  rate  for  the  two  more 
active  streams.  In  fact,  there  is  a  slight  increase.  On 
the  other  hand,  the  loss  rate  for  the  six  background  has 
been  improved  significantly.  Losses  are  of  the  order  of 
10“5  only  for  system  utilisations  higher  than  0.9. 


The  more  active  streams  have  longer  batches  as 
compared  to  other  streams.  Long  batches  will  reserve 
more  space  in  the  service  buffer  once  they  arrive  and 
are  accepted  by  the  system.  Suppose  that  a  more  ac¬ 
tive  stream  is  followed  by  a  few  less  active  ones  which 
is  followed  in  turn  by  a  more  active  one  and  so  on. 
This  is  the  case  for  the  batch  losses  in  Figure  10. 
Once  a  long  batch  is  accepted  by  the  system  it  will 
be  shifted  to  the  service  buffer.  The  longer  the  inter- 
arrival  interval  r  is  the  more  cells  will  be  served  before 
the  next  batch  arrives  thus  creating  more  free  slots  in 
the  service  buffer  for  the  next  arrivals.  This  implies 
that  fewer  batches  will  be  dropped  from  the  less  active 
streams  which  follow  (decrease  in  the  loss  rate  for  the 
less  active  streams  as  seen  in  Figure  10).  On  the  other 
hand,  if  the  interarrival  interval  r  is  the  same  for  all 
streams  then  shorter  batches  from  less  busy  streams 
may  have  to  be  dropped  more  frequently. 
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Figure  3:  Batch  loss  ctirves  for  N=4  streams,  period  T=40  and  deadlines  D=40(solid),  CO(dotted), 
80(dashed  line)  for  HOMOGENEOUS  periodic  traffic  with  GEOMETRIC  distribution.  The  losses 
decrease  as  a  function  of  increasing  deadlines  as  expected. 


Figure  4:  Batch  loss  curves  for  N=4  streams,  period  T=40  and  deadlines  D=40  and  CO  (GEOMET¬ 
RIC  distribution).  Top  line  is  FIFO  without  expired  cell  discarding,  line  with  the  stars  is  FIFO 
with  expired  cell  discarding,  line  with  circles  is  Policy  2  and  bottom  solid  line  is  Policy  1. 
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Figure  5:  Batch  loss  curves  for  N=8  streams,  period  T=64  and  deadlines  D=64  and  96  (GEOMET¬ 
RIC  distribution).  Top  solid  line  is  FIFO  without  expired  cell  discarding,  dashed  line  is  FIFO  with 
expired  cell  discarding,  dotted  line  is  Policy  2  and  bottom  solid  line  is  Policy  1. 


Figure  6:  Cell  loss  curves  for  N=8  streams,  period  T=64  and  deadlines  D=64  and  96  (GEOMETRIC 
distribution).  Top  line  is  FIFO  without  expired  cell  discarding,  line  with  stars  is  FIFO  with  expired 
cell  discarding,  line  with  circles  is  Policy  2  and  bottom  solid  line  is  Policy  1. 
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Figure  7:  Batch  loss  curves  for  N=8  streams,  period  T=64  and  deadlines  D=64  and  9G  for  BURSTY 
1-L  periodic  traffic.  Top  line  is  FIFO  without  expired  cell  discarding,  line  with  stars  is  FIFO  with 
expired  cell  discarding,  line  with  circles  is  Policy  2  and  solid  line  is  Policy  1. 
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Figure  8:  Cell  loss  curves  for  N=8  streams,  period  T=64  and  deadlines  D=64  and  96  for  BURSTY  1-L  periodic 
traffic.  Top  line  is  FIFO  without  expired  cell  discarding,  line  with  stars  is  FIFO  with  expired  cell  discarding,  line 
with  circles  is  Policy  2  and  bottom  solid  line  is  Policy  1. 


Figure  9:  Batch  loss  curves  for  N=8  streams,  period  T=64  and  deadline  D=G4  for  INHOMOGE¬ 
NEOUS  BURSTY  1-L  periodic  traffic.  Top  line  is  FIFO  without  expired  cell  discarding,  line  with 
stars  is  FIFO  with  expired  cell  discarding,  line  with  circles  is  Policy  2  and  bottom  solid  line  is 
Policy  1. 


Figure  10:  Batch  loss  curves  for  N  =  8  streams,  period  T=64  and  deadline  D  =  G4  for  INHOMOGE¬ 
NEOUS  BURSTY  1-L  periodic  traffic.  Solid  line  is  Policy  1  with  equal  values  of  r  and  line  with 
circles  is  Policy  1  with  weighted  values  of  r  (tau). 
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Abstract  -  In  this  work,  call  admission  and  transmis¬ 
sion  scheduling  policies  are  studied  for  a  TDMA 
system  servicing  Variable  Bit  Rate  (VBR)  appli¬ 
cations  with  distinct  QoS  requirements  and  traf¬ 
fic  characteristics.  In  this  environment,  packets 
which  experience  excess  delays  are  dropped  (due 
to  delay  violations)  at  the  source.  In  addition, 
packets  are  dropped  at  the  receiver  due  to  channel 
induced  errors  (interference)  in  the  wireless  link. 
The  focus  of  the  research  is  to  determine  the  re¬ 
gion  of  achievable  QoS  vectors  for  heterogeneous 
VBR  applications  in  this  shared  resource  environ¬ 
ment  and  study  the  impact  that  channel  quality 
has  on  the  achievable  performance.  The  region  of 
achievable  QoS  vectors  is  central  to  the  call  admis¬ 
sion  problem  and  in  this  work,  it  is  used  to  identify 
a  class  of  scheduling  policies  capable  of  delivering 
any  achievable  performance. 


I.  INTRODUCTION 

In  Integrated  Services  Wireless  Networks  resources 
are  shared  among  geographically  disperse  applications 
with  diverse  traffic  characteristics  and  QoS  require¬ 
ments.  The  shared  transmission  resources  are  defined 
to  be  the  slots  (packet  transmission  times)  of  a  TDM  A 
frame.  This  resource  structure  has  been  widely  con¬ 
sidered  in  both  cellular  systems  [1]  and  wireless  LANs 
[2],  as  well  as  in  recent  work  toward  the  development 
of  wireless  ATM  networks  [3,  4]. 

The  focus  of  this  work  is  to  determine  the  region 
of  achievable  QoS  vectors  for  heterogeneous  VBR  ap¬ 
plications  in  a  shared  wireless  environment  .  The  QoS 
requirements  for  each  application  is  defined  in  terms  of 
a  maximum  tolerable  packet  delay  and  dropping  prob¬ 
ability.  Determining  the  region  of  achievable  QoS  vec- 


Research  supported  in  part  by  the  Advanced  Research 
Project  Agency  under  Grant  F49620-93- 1-0564  monitored  by 
the  Air  Force  Office  of  Scientific  Research  (AFOSR)  and  the 
National  Science  Foundation  under  Grant  NCR  9628116. 


tors  is  central  to  the  development  of  a  Call  Admission 
Control  mechanism.  For  example,  if  with  the  addition 
of  the  new  source,  the  new  multi-dimensional  target 
QoS  vector  is  in  the  region  of  achievable  QoS  vectors 
then  the  call  can  be  admitted.  If  the  call  can  not  be 
admitted,  but  more  resources  can  be  made  available, 
a  precisely  defined  region  of  achievable  QoS  can  be 
used  to  determine  the  minimum  additional  resources 
required  in  order  for  the  new  call  to  be  admitted,  in 
this  work,  the  region  of  achievable  QoS  vectors  is  also 
used  to  identify  a  class  of  scheduling  policies  capable 
of  delivering  any  achievable  performance. 

In  an  error-free  channel  all  transmitted  packets  are 
successfully  received.  In  this  case,  the  delivered  QoS  is 
shaped  by  the  packet  discarding  process  at  the  trans¬ 
mitter  (source)  due  to  delay  violations;  the  latter  oc¬ 
cur  when  the  demand  exceeds  the  amount  of  avail¬ 
able  resources  for  a  sufficiently  long  period.  Thus,  the 
performance  is  limited  by  the  amount  of  available  re¬ 
sources  (resource  limited).  The  region  of  achievable 
QoS  vectors  in  an  error-free  channel  environment  has 
been  investigated  in  [5]. 

While  an  optical-fiber  based  channel  may  be  prac¬ 
tically  considered  to  be  error-free,  this  is  not  the  case 
with  wireless  channels.  Although  the  necessary  re¬ 
sources  may  become  available  on  time,  packets  may 
be  corrupted  due  to  channel  induced  errors  and  be 
dropped  at  the  receiver.  Under  these  conditions,  the 
performance  is  limited  by  the  interference  introduced 
in  the  wireless  channel  (interference  limited).  Such 
packet  discarding  may  occur  with  a  frequency  compa¬ 
rable  to  that  of  the  packet  discards  at  the  transmitter 
due  to  resource  limitations.  As  a  consequence,  the  re¬ 
gion  of  achievable  QoS  vectors  is  shaped  by  the  packet 
discarding  process  at  both  the  transmitter  and  the  re¬ 
ceiver  due  to  resource  and  interference  limitations,  re¬ 
spectively. 

II.  DESCRIPTION  OF  THE  SYSTEM  MODEL 

Consider  a  system  where  N  VBR  (Variable  Bit  Rate) 
sources  compete  for  T  slots  in  the  up-link  channel.  T 


Let  0  be  an  indicator  function  of  an  outage  (error 
event).  That  is, 


represents  the  number  of  slots  in  the  TDMA  frame 
available  for  VBR  traffic.  At  the  beginning  of  each 
frame  n  each  source  i  will  request  a  random  number 
of  slots  denoted  by  A »(n).  If  the  aggregate  demand 
in  frame  n,  YliLi  A»(n),  exceeds  the  number  of  slots 
available  to  the  VBR  traffic  (T)  -  referred  to  as  an 
overloaded  frame  *  then  decisions  must  be  made  re¬ 
garding  the  amount  of  service  that  will  be  provided 
to  each  source.  The  portion  of  slots  under  policy  / 
allocated  to  source  i,  a/(n),  may  be  less  than  what 
is  required  by  that  source,  A,(n),  due  to  resource  lim¬ 
itation.  Packets  from  a  source  which  do  not  receive 
service  over  a  frame  are  considered  to  have  excess  de¬ 
lay  and  are  dropped  at  the  source. 

The  above  environment  could  model  the  sharing 
of  the  up-link  TDMA  frame  of  a  wireless  network 
where  request  are  processed  at  frame  boundaries.  Fig. 
1  illustrates  a  typical  up-link  TDMA  frame  whose 
slots  are  allocated  to  various  classes  of  services,  such 
as  Continuous  Bit  Rate  (CBR),  Available  Bit  Rate 
(ABR)  and  Variable  Bit  Rate  (VBR).  In  this  figure 
T  =  3  slots  are  assumed  to  be  available  to  the  VBR 
applications. 

Request  Channel 


C  Assigned  CBR  slot 
V  Slot  available  for  VBR  traffic 
A  Slot  available  for  ABR  traffic 


f  1  if  5/ft  <  6 
{  0  if  SIR  >  b  ' 


(2) 


In  this  model,  it  is  assumed  that  outages  are  indepen¬ 
dent  and  identically  distributed;  with  minor  modifica¬ 
tions,  correlation  among  outages  can  be  easily  consid¬ 
ered. 

Due  to  the  above  competition  for  the  resources  and 
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Suppose  that  the  QoS  requirement  of  application  i 
is  defined  in  terms  of  a  maximum  tolerable  average  per 
frame  packet  dropping  rate  r/,,  0  <  i  <  N .  Then  the 
QoS  vector  associated  with  the  supported  applications 
can  be  defined  in  terms  of  the  (performance)  packet 
dropping  rate  vector  d, 

d  —  (r/i,  r/'j,  . . . ,  r/,v).  (A) 


Figure  1.  TDMA  frame  supporting  CBR,  VBR  and  ABR 
traffic  classes. 

In  this  paper  a  transmitted  packet  will  also  be  dis¬ 
carded  at  the  receiver  if  the  signal  to  interference  ratio 
(SIR)  falls  below  a  threshold  6.  In  a  wireless  chan¬ 
nel,  the  received  signal  power  may  be  described  by 
taking  into  consideration  three  effects  [6]:  attenua¬ 
tion  due  to  distance,  shadowing  -  typically  described 
by  a  log-normal  distribution,  and  fading  that  causes 
the  instantaneous  envelope  of  the  received  signal  to 
be  Rayleigh  distributed  and  consequently  its  power  is 
exponentially  distributed.  An  outage  is  defined  as  an 
event  that  occurs  when  the  SIR  falls  below  a  prede¬ 
termined  threshold,  6,  sometimes  called  capture  ratio. 
It  is  assumed  that  when  a  source  experiences  an  out¬ 
age  its  packet  is  corrupted,  otherwise,  it  is  correctly 
received.  Therefore,  the  probability  that  a  packet  is 
discarded  at  the  receiver  is  defined  as, 

(1) 


When  the  QoS  requirement  of  the  application  i  is  de¬ 
fined  in  terms  of  a  maximum  tolerable  packet  dropping 
probability  pt,  the  corresponding  packet  dropping  rate 
d{  is  easily  determined  by  di  =  A,  p,-. 

The  first  question  addressed  in  the  sequel  (section 
III)  is  whether  (under  the  given  channel  conditions) 
a  given  QoS  vectors  d  is  achievable  under  any  policy 
/.  The  second  question,  addressed  in  section  IV,  is 
concerned  with  the  design  of  scheduling  policies  that 
deliver  an  achievable  target  QoS  vector  d. 

III.  DETERMINATION  OF  THE  REGION  OF 
ACHIEVABLE  QOS  VECTORS 

The  establishment  of  the  region  of  achievable  QoS  vec¬ 
tors  is  based  on  a  set  of  inequalities  and  an  equality 
constraint  derived  by  employing  work-conserving  ar¬ 
guments.  A  work-conserving  policy  (denoted  as  /) 
does  not  waste  resources  (slots)  as  long  as  there  is 
work  to  perform  (packets  to  transmit).  Details  re¬ 
garding  these  derivations  may  be  found  in  [7]. 


P[SI R  <  6]. 


Let  5  =  {1,2, . . JV}  be  the  set  of  all  sources  and 
c/|5j  denote  the  average  system  packet  dropping  rate 
under  scheduling  policy  /,  denoted  by, 

d\S]  =  E  =  £  E  K (n)]  =  2  d'  ■  (5) 

Lt=l  J  *=1  i— 1 

Define  A5(n)  =  A,(n)  for  any  g  C  5. 

Under  any  work-conserving,  /,  the  average  system 
packet  dropping  rate  is, 

d{s}  =  £[/?]{£  [A5(n)  |  As(n)<T]}  (6) 

■Pr(Xs(n)<T) 

+  {E  [As(n)  |  A s(n)  >  t]  -T(l  -  £[/?])} 

■Pr  (As(n)  >  T) . 

As  it  can  be  seen  from  (6),  c/|5j  is  independent  of  the 

policy  /.  Therefore,  the  system  dropping  rate, 
is  conserved  under  any  policies  /  and  is  denoted  as 
&{S). 

Let  c/j^j  denote  the  average  subsystem  {#}  packet 
dropping  rate  under  policy  /,  defined  by, 


=  XX’  9CS- 

»€</ 


That  is,  is  equal  to  the  aggregate  packet  dropping 
rate  associated  with  sources  in  group  g  only,  under 
policy  /;  all  N  sources  in  5  are  assumed  to  be  present 
and  served  under  policy  /. 

Let  b{g }  denote  the  lower  bound  for  the  aggregate 
packet  dropping  rate  for  sources  in  {<7},  determined 
to  be  equal  to  the  packet  dropping  rate  of  a  system  in 
which  only  sources  in  {<7}  are  present  and  served  under 
a  work-conserving  policy.  Sources  in  set  {5  —  <7}  are 
considered  to  be  removed.  It  is  given  by, 

6w  =  E[/?]{E[A,(n)|AJ(n)<r]}  (8) 

■Pr(Xg(n)<T) 

+  {£;[A,(n)  |  A»>r]-T(l  — £[/?])} 

■Pr(A,(n)>T). 

It  is  apparent  that  no  policy  can  deliver  a  lower  drop¬ 
ping  rate  to  sources  in  set  g  than  ,  when  all  sources 
in  5  are  present.  Thus,  6^}  is  a  lower  bound  on  the 
performance  induced  for  the  set  g  under  any  policy. 
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Dropping  rate,  source  1 

Figure  2.  The  region  (polytope)  V  for  a  system  with  two 
sources. 

It  is  shown  in  [7]  that  necessary  and  sufficient  con¬ 
ditions  for  a  QoS  vector  d  =  (d\ ,  . . . ,  dt\)  to  be 

achieved  by  a  scheduling  policy  /  are  that  its  compo¬ 
nents  satisfy  the  following  constraints, 

d{3)  >  b{3)  VyQS  (9) 

d{s)  =  b{S).  (10) 

Let  V  denote  the  collection  of  all  vectors  d  satisfying 
(9)  and  (10).  Any  vector  in  the  set  V  can  be  expressed 
as  a  convex  combination  of  extreme  points  (vertices) 
of  V\  that  is,  V  may  be  expressed  as  the  convex  hull 
of  its  extreme  points,  P=conv[exp(D)]l  see  [7].  It  is 
shown  in  [7]  that  d“  is  a  vertex  of  the  set  V  iff  d*  is  a 
dropping  rate  vector  resulting  from  an  Ordered  HoL 

(O-HoL)  priority  service  policy,  7r=  (tti  ,  7r2 _ ,?r;v); 

TTj  £  {1, 2, . . . ,  N}y  Xi  ttj  ,  1  <  i,  j  <  N .  The  index 
of  7 r,-  indicates  the  order  of  the  priority  given  to  the  7T ,• 
source.  None  of  the  tt ;  sources,  j  >  i,  may  be  served 
as  long  as  packets  from  sources  *>,  k  <  i,  are  present. 

Fig.  2  and  Fig.  3  provide  a  graphical  illustration 
of  the  region  V  for  the  case  of  N  =  2  and  N  =  3 
sources,  respectively.  The  extreme  points  correspond 
to  QoS  vectors  d  induced  by  the  N\  ordered  HoL  pri¬ 
ority  policies  7r=  (7ri,7r2,  ...,ttjv).  Referring  to  Fig. 

2,  it  may  be  observed  that  the  policy  (7Ti,7r2)  =  (1,2) 
corresponds  to  the  intersection  of  the  line  for  the  lower 
bound  on  the  packet  dropping  rate  line  for  source  1, 
6{i},  with  the  system  dropping  rate  line  6{i,2)>  Simi¬ 
larly,  the  second  extreme  point  induced  by  the  policy 
( 7T i ,  7T2 )  =  (2,  1)  is  the  intersection  of  the. lower  bound 
on  the  packet  dropping  rate  for  source  2,  6{2j  and 
6{ii2).  Similar  observations  can  be  made  for  the  re¬ 
gion  V  for  a  system  of  N  =  3  sources  shown  in  Fig. 

3. 


that, 
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Figure  3.  The  region  (polytope)  V  for  a  system  with  three 

sources. 

IV.  A  CLASS  OF  POLICIES  INDUCING  ALL  QOS 
VECTOR  IN  V 

Let  Co-HoL  denote  the  class  of  Ordered  HoL  (O-HoL) 
priority  service  policies  7T  introduced  in  section  III.  A 
mixing  O-HoL  priority  service  policy  frn  is  defined  to 
be  one  that  at  each  frame  decides  to  follow  the  O-HoL 
priority  policy  7r'  =  (  with  probability 

a,  >0,  1  <  2  <  AM,  =  i  Q»  —  li  decisions  over 
consecutive  frames  are  independent.  Clearly,  }rn  is 
completely  determined  by  the  /V!  dimensional  vector 
a,  a  >  0.  1-  a  =  1 .  Let  Mo-HoL  denote  the  class  of 
such  policies. 

For  each  packet  dropping  rate  vector  d  E  V  there 
exists  a  policy  /m  E  Mo-HoL  that  induces  d.  The 
proof  follows  from  the  fact  that.  V  can  be  written 
as  a  convex  combination  of  the  extreme  points  (ver¬ 
tices)  dcr,_t-  of  P,  that  is  d  =  Yli2\  Qidext-i  for 
some  q  —  (ai ,  c*2, . . . ,  a/vj)  where  a,  >0,  1  <  i  < 
N\y  Qi  ==  1*  Since  each  dexf_,  is  induced  by 

some  policy  in  Co-HoL ,  a  mixing  policy  fm  that 
selects  the  HoL  priority  7r*  (that  induces  dexr_,) 
with  probability  a,-  such  that  or,*  >  0,  1  <  i  < 
Ar ! ,  j  a,-  =  1  will  have  a  packet  dropping  rate 

vector  d^m  given  by, 

/V! 

d/m  =  (11) 

«=1 

and  thus  fm  induces  d. 

Let  d  E  P  be  a  target  packet  dropping  rate  vector. 
The  mixing  policy  fm  =  a  induces  d,  where  a  is  such 


£a  [dert]  =  ^a;dert-.  =  <1,  (12) 

i=l 

a  >  0,  (13) 

1  a  =  1,  (14) 

where  Eql[\  is  weighted  average  of  the  set  of  extreme 

points,  dexf,  ofP  with  respect  to  the  Probability  Mass 
Function  a. 

V.  NUMERICAL  AND  SIMULATION  RESULTS 

In  this  section,  numerical  examples  and  simulations 
are  carried  out  for  a  system  with  two  sources  compet¬ 
ing  for  5  slots  in  a  TDMA  frame.  The  source  packet 
arrival  processes  are  assumed  to  be  mutually  indepen¬ 
dent.  Each  arrival  process  is  described  in  terms  of  an 
underlying  finite-state  Markov  chain  embedded  at  the 
frame  boundaries.  The  number  of  packets  generated 
(and  requesting  service)  by  a  source  in  the  current 
frame  boundary  is  (probabilistically)  determined  by 
the  present  state  of  the  associated  underlying  Markov 
chain. 

Consider  in  this  example  the  typical  TDMA  frame 
in  which  one  slot  per  frame  represents  a  32  Kbps  chan¬ 
nel.  Let  Ek  =  {0,  1 , . . . ,  AIk  —  l }  denote  the  state 
space  of  the  Markov  chain  associated  with  source  A*, 
A*  =  1,2.  In  this  example  A/1  =  6  and  M 2  =  G. 
When  in  state  i ,  a  source  generates  i  packets  in  the 
current  frame.  Source  1  and  2  could  be  used  to  model 
a  low  rate  video  transmissions  with  maximum  rate  G 
slots/frame  =  192  Kb[  )s.  Further  description  of  all 
two  sources  can  found  in  Table  I.  The  arrival  rates 
are  easily  computed  and  are  equal  to  X\  =4.2  and 
An  =  3.2  packets  per  frame. 


State 

State  Probability 

Source  1 

Source  2 

0 

0 

0.1 

1 

0.1 

0 

2 

0.1 

0.4 

3 

0.3 

0 

4 

0.2 

0.3 

5 

0.2 

0 

G 

0.1 

0.2 

Table  I 

Description  of  Sources. 

With  the  sources  given  above,  (9)  and  (10)  were 
evaluated  with  T  =  5  slots  and  under  various  channel 
conditions.  The  impact  channel  quality  has  on  the  re¬ 
gion  of  achievable  QoS  vectors  can  be  seen  in  Table  II. 


Numerical  and  Simulation  Results 


The  channel  quality  has  a  greater  relative  effect  on  the 
lower  bounds  for  the  subsets  than  on  the  system.  This 
implies  that  when  the  system  is  very  diversified,  and 
some  sources  are  requesting  service  close  to  their  lower 
bounds,  then  the  channel  quality  is  an  important  fac¬ 
tor  in  determining  whether  the  target  performance  is 
achievable.  Simulation  results  are  presented  in  Table 
HI.  The  previously  described  sources  in  Table  I  were 
simulated  and  service  (with  T  =  5  slots  per  frame)  ac¬ 
cording  to  various  mixing  policies.  Each  mixing  policy 
was  given  by  a,  which  is  the  probability  of  selection 
of  O-HoL  policy  7t=(1,2)  (and  therefore  7r=(2,l)  with 
probability  (1  —  a)),  and  varied  from  0  to  1. 


Theoretical  Results 

m 

6m 

6{2} 

0 

0.1000 

0.2000 

2.0700 

0.01 

0.1350 

0.2300 

2.1173 

0.02 

0.1700 

0.2600 

2.1646 

Table  II 

Impact  of  channel  quality  on  the  region  of  achievable  QoS. 


Simulation  Results 

II3E9 

G 

di 

d2 

Eszai 

0.00 

0.00 

1.8835 

0.1978 

2.0814 

0.25 

2.0814 

2.0814 

SalliS 

1.5455 

2.0814 

1.9844 

\mm 

0.2279 

2.1286 

’iHHi 

1.4540 

0.6746 

2.1286 

■ 

1.0101 

1.1184 

2.1286 

■  H 

0.75 

0.5668 

1.5618 

2.1286 

1.00 

0.1324 

1.9962 

2.1286 

0.02 

0.00 

1.9178 

0.2586 

2.1764 

0.7008 

2.1764 

0.50 

1.0362 

1.1402 

2.1764 

0.75 

0.5979 

1.5784 

2.1764 

2.0086 

2.1764 

Table  III 

Simulation  results  displaying  the  effects  of  the  channel 
quality  on  the  performance  of  various  mixing  policies. 

Let  dk  denote  the  target  packet  dropping  probabil¬ 
ity,  for  source  k,  k  =  1,2.  In  this  example  it  is  assumed 
that  d\  =  1.220  and  do  =  0.900.  Thus,  the  QoS  vec¬ 
tor  cl  is  given  by  d  =  (di,d2)  =  (1.220,0.900)  and 
the  associated  system  packet  dropping  rate  is  equal  to 
d{S)  =  d\  +d2  =  2.120.  By  satisfying  (9)  and  (10)  (see 
Table  II),  it  can  be  established  that  the  performance 
can  be  achieved  when  E[/3 ]  <  0.01.  When  the  channel 


Figure  4.  Numerical  evaluation  of  the  region  of  achievable 
QoS  under  various  channel  conditions  for  the  system  of 
sources  given  in  Table  1  {T  =  5)  and  simulation  results 
from  Table  3. 

condition  are  given  by  £[J]  =  0.02,  the  system  drop¬ 
ping  rate  can  not  be  satisfied,  and  there  the  QoS  can 
not  be  delivered  to  applications. 

When  E[3]  <  0.01,  d^s)  >  6{5}  and  thus,  ac¬ 
cording  to  [7],  there  exists  a  vector  d'  such  that 
g  C  S  and  =  b{S}\  d'  is  said  to 

have  improved  performance.  Consider  the  case  when 
£[/?]  =  0.01,  and  let  d'  =  (1.218,0.899);  d'  €  V  and 
thus  there  exists  a  mixing  ordered  IloL  priority  pol¬ 
icy  fm  =  ol  achieving  exactly  the  QoS  vector  d'.  Any 
a  satisfying  the  conditions  (12),  (13)  and  (14)  may 
be  chosen.  The  following  a0  was  found  by  employing 
linear  programming  techniques, 


q  i  =  0.3S1S 
g  2  =  0.6181 


(15) 


In  this  example  there  exists  only  one  solution,  but  in 
systems  with  greater  than  2  sources  more  than  one  so¬ 
lution  may  be  found.  This  allows  for  the  incorporation 
of  additional  constraints  representing  other  desirable 
qualities  of  the  policies.  Functions  of  interest  may  be 
minimized  subject  to  the  constraints  presented  in  this 
paper  to  guarantee  the  delivery  of  the  target  QoS  vec¬ 
tor.  For  instance,  among  all  mixing  policies  inducing 
d',  the  one  which  minimizes  the  variance  of  the  service 
provided  to  certain  sources  may  be  identified. 

A  simulation  was  performed  using  the  policy,  a0, 
derived  for  this  example.  The  sources,  as  described  in 
Table  I,  were  simulated  and  generated  the  service  de¬ 
mand.  Resources  (slots)  were  allocated  according  to 
the  prescribed  policy  and  the  number  of  dropped  pack¬ 
ets  per  frame  from  each  source  was  recorded.  The  sim¬ 
ulation  time  was  50000  frames  or  2000  seconds  (i.e.,  40 


Source,  i 

Theoretical,  di 

Simulation,  d{(k) 

1 

1.218 

1.211 

2 

0.899 

0.893 

Table  IV 

Comparison  of  time-averaged  and  theoretical  steady-state 
mean  packet  dropping  rate  for  each  source  under  policy 

Qo. 

ms  frame).  Throughout  the  simulation  in  each  frame 
rn .  for  each  source  i,  the  time-averaged  mean  dropping 
rate  was  calculated  as, 

,  * 

=  (16) 
m  =  1 

The  resulting  time-averaged  mean  packet  dropping 
rate  for  each  source  under  the  policy  was  found  and  is 
displayed  in  table  IV.  Throughout  the  progression  of 
the  simulation  the  tirne-averaged  mean  packet  drop¬ 
ping  rates  were  recorded  and  are  displayed  in  Fig. 
5. Under  the  policy  the  objective  of  mean  packet  drop¬ 
ping  rate  c/,  for  each  source  i  is  met,  see  table  IV  and 
Fig.  5. 


Simulation  Results  for  Mixing  Policy  a,  and  E(pJ=0.01 


Figure  5.  Simulation  results  obtained  with  the  derived 
policy  ot O' 

As  it  was  stated  in  section  3,  satisfying  the  condi¬ 
tion  on  the  system  performance,  given  by  (10),  is  only 
necessary  and  not  sufficient  to  guarantee  that  the  tar¬ 
get  QoS  vector  is  achievable1.  To  illustrate  this  con¬ 
cept,  consider  the  system  given  in  the  previous  exam¬ 
ple  with  E[j3]  =  0  and  let  d  =  (0.099,  1.971)  be  the 

1  In  the  special  case  of  a  homogeneous  system,  such  as  a 
cellular  voice  system,  satisfying  (10)  is  sufficient  to  guarantee 
that  the  target  QoS  vector  is  achievable.  This  result  has  been 
established  in  [5]. 


target.  QoS  vector,  d  satisfies  the  condition  on  the 
system  dropping  rate,  that  is  j  =  6(5}  =  2.07,  but 
d\  <  b\  =  0.100.  Thus,  the  target  QoS  vector  cannot 
be  achieved  under  any  policy.  For  this  system,  the 
best  possible  performance  source  1  can  achieve  is  if  it 
is  given  service  priority,  resulting  in  dropping  rates  of 
0.100  and  1.970  for  sources  1  and  2,  respectively.  The 
overall  system  performance  is  satisfied,  but  source  1  is 
experiencing  poorer  service  than  what  is  desired,  while 
sources  2  is  experiencing  improved  performance. 

VI.  CONCLUSION 

In  this  work,  the  region  of  achievable  QoS  vectors  was 
established  for  a  system  of  heterogeneous  VBR  sources 
competing  for  slots  of  a  TDMA  frame.  The  QoS  for 
each  application  was  defined  to  be  the  packet  dropping 
rate,  which  results  from  the  competition  for  resources 
(slots  may  not  come  available  on  time  and  a  source  is 
forced  to  drop  the  packets  with  excess  delay)  as  well  as 
channel  induced  errors  (corrupted  packets  are  dropped 
at  the  receiver).  Knowledge  of  this  region  was  used  to 
develop  a  class  of  scheduling  policies  that  can  deliver 
any  achievable  performance.  Together,  the  call  admis¬ 
sion  and  scheduling  functions  can  ensure  the  delivery 
of  the  target  QoS  vector  under  the  given  channel  con¬ 
ditions. 
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Abstract 

Typical  rate-based  traffic  management  schemes  for  real¬ 
time  applications  attempt  to  allocate  resources  by  control¬ 
ling  the  packet  delivery  to  the  resource  arbitrator  (sched¬ 
uler).  This  control  is  typically  based  only  on  the  charac¬ 
teristics  of  the  particular  (tagged)  traffic  stream  and  would 
fail  to  optimally  adjust  to  non-nominal  network  conditions 
such  as  overload.  In  this  paperf  an  integrated  Regulation 
and  Scheduling  (dynamic- R&S)  scheme  is  proposed  whose 
regulation  function  is  modulated  by  both  the  tagged  stream ’s 
characteristics  and  some  information  capturing  the  state 
of  the  co-existing  applications  as  provided  by  the  scheduler. 
The  performance  of  the  proposed  scheme  -  as  well  as  that  of 
the  equivalent  static  one  -  is  investigated  under  both  under¬ 
load  and  overload  traffic  conditions  -  both  analytically  and 
through  simulation  -  and  the  substantially  better  through¬ 
put  /jitter  characteristics  of  the  dynamic-R&S  scheme  are 
established. 


1  Introduction 

A  secure  solution  to  the  problem  of  guaranteeing  the 
QoS  of  real-time  applications  will  typically  require  the 
reservation  of  the  maximum  amount  of  needed  resources. 
Because  of  the  anticipated  low  -  due  to  resource  require¬ 
ment  fluctuation  -  network  utilization,  alternative  solu¬ 
tions  are  being  considered  based  on  “over  allocation”  of 
resources  to  a  group  of  applications  (multiplexing).  Over 
allocation  aims  at  improving  the  resource  utilization,  at  the 
expense  of  failing  to  accommodate  at  certain  times  peak 
resource  demands  from  a  number  of  the  grouped  applica¬ 
tions.  Grouping  of  applications  and  “over-allocation”  of 
resources  are  the  key  aspects  of  non-degenerate  statistical 
multiplexing. 

Because  of  the  stringent  QoS  requirements  of  real-time 
applications,  it  is  expected  that  “traditional”  statistical 
multiplexing  schemes, such  as  FCFS,  will  not  be  effective 

•This  research  is  supported  in  part  by  the  Advanced  Research 
Project  Agency  under  Grant  F49620-93- 1-0564  monitored  by  the  Air 
Force  Office  of  Scientific  Research  (AFOSR)  and  the  National  Science 
Foundation  under  Grant  NCR-9628116. 


for  such  applications.  It  is  well  understood  that  some 
tighter  control  should  be  exercised  on  input  output  as  well 
as  in  the  internal  processes  of  traffic  handling.  In  principle, 
a  target  value  of  a  QoS  metric  may  be  possible  to  achieve 
through  either  tight  traffic  regulation  (typically  referred 
to  as  rate-based  approach),  or  sophisticated  scheduling 
(typically  referred  to  as  scheduler-based  approach)  only. 
In  most  practical  cases  though,  some  scheduling  will  be 
needed  to  resolve  transmission  conflicts  among  rate-based 
controlled  applications.  Similarly,  some  traffic  filtering 
(regulation)  will  be  needed  to  eliminate  extreme  traffic  re¬ 
alization  which  would  be  hard  to  manage  even  by  a  so¬ 
phisticated  scheduler  under  a  scheduler-based  approach.  In 
addition  to  the  supporting  role  of  regulation  in  a  scheduler- 
based  approach,  and  scheduling  in  a  rate-based  approach, 
higher  multiplexing  gain  may  be  achieved  by  allowing  for 
some  cooperation  between  the  two  functionalities  which 
will  typically  co-exist  and  be  designed  to  achieve  a  com¬ 
mon  goal.  Such  a  dynamic  policy  is  investigated  in  the 
present  paper. 

Substantial  effort  has  been  directed  toward  the  devel¬ 
opment  of  regulation  and  scheduling  schemes  for  real¬ 
time  applications.  Examples  of  regulation  and  schedul¬ 
ing  schemes  for  real-time  applications  include  :  D-EDD 
(Delay-Earliest  Due  Date)  [I];  J-EDD  (Jitter  Earliest  Due 
Date)  [2];  HRR  (Hierarchical  Round  Robin)  [3];  S&G  (Stop 
and  Go  Queuing)  [4];  WFQ  (Weighted  Fair  Queuing)  [5]; 
PGPS  (Packet  Generalized  Processor  Sharing)  [6];  RCSP 
(Rate  Controlled  Static  Priority)  [7];  LIT  (Leave  In  Time) 
[8];  MRTS  (Multi  Rate  Traffic  Shaping)  [9];  VC  (Virtual 
Clock)  [10]. 

The  traffic  management  scheme  for  real-time  applica¬ 
tions  investigated  in  this  work  may  be  viewed  as  an  en¬ 
hancement  of  the  Rate  Controlled  Static  Priority  (RCSP) 
scheme  proposed  in  [7]. 

2  The  Proposed  Dynamic  R&S  Policy 

The  typical  primary  objective  in  regulating  real-time 
traffic  stream  within  the  network  is  to  control  jitter  or 
the  instantaneous  rate  (throughput).  This  is  achieved  in 
the  RCSP  mechanism  [7]  by  enforcing  a  minimum  spacing 


at  the  output  of  the  regulator  associated  with  the  traffic 
stream  of  interest  (tagged  traffic  stream).  Fig.  1  (without 
the  feedback  arrow)  shows  a  block  diagram  of  an  architec¬ 
ture  implementing  a  RCSP  mechanism  where  each  of  the 
Ar  multiplexed  streams  is  regulated  before  it  is  considered 
for  transmission. 


Regulator  Block 


Figure  1:  The  Regulation  and  Scheduling  architecture. 


Since  scheduling  conflicts  will  arise  when  more  than  one 
regulated  applications  are  present,  a  scheduler  needs  to  be 
employed  to  resolve  these  conflicts.  A  consequence  of  the 
scheduling  conflicts  is  that  the  tagged  traffic  stream  at  the 
output  of  the  scheduler  will  be  a  distorted  version  of  the 
target  stream  enforced  at  the  output  of  the  correspond¬ 
ing  regulator.  For  instance,  although  a  minimum  spacing 
between  consecutive  tagged  cells  is  enforced  at  the  out¬ 
put  of  the  tagged  regulator  in  Fig.  1,  this  docs  not  hold 
true  for  the  tagged  stream  at  the  output  of  the  scheduler. 
This  clustering  is  generated  due  to  an  increased  arrival  rate 
to  the  scheduler  in  the  immediate  past  which  has  pushed 
back  (delayed)  earlier  tagged  cells.  Due  to  the  latter,  some 
spreading  followed  by  some  clustering  of  tagged  cells  is  ex¬ 
pected  to  be  observed  at  the  output  of  the  scheduler. 

The  tagged  cell  spreading  mentioned  above  can  be  re¬ 
duced  by  monitoring  the  scheduler  and  releasing  a  tagged 
cell  before  its  eligibility  time  1  when  scheduler  queue  build 
ups,  which  will  cause  the  spreading,  are  detected.  The  dy¬ 
namic  Regulation  and  Scheduling  (dynamic-R&S)  scheme 
proposed  below  attempts  to  provide  for  a  smoother  tagged 
traffic  at  the  output  of  the  scheduler  based  on  this  idea. 

Under  the  dynamic-R&S  policy  proposed  below,  the  reg¬ 
ulation  process  is  modulated  by  some  scheduler  status  in¬ 
formation.  Unlike  in  past  work  in  the  area,  appropriate 
information  regarding  the  status  of  the  scheduler  (FCFS 
queue)  is  fed  back  to  the  regulators,  as  indicated  in  Fig.  1 
with  the  feedback  arrow.  The  proposed  cell  release  scheme 
is  explained  next. 

Let  tk  denote  the  time  slot  at  which  the  ktfl  tagged 
cell  is  released  from  the  tagged  regulator.  Let  Qrk  denote 

*Here  defined  as  T  time  units  following  the  previous  tagged  cull 
release,  if  a  minimum  spacing  of  T  is  targeted. 


the  queue  occupancy  at  the  regulator  upon  (following)  the 
release  of  the  kth  cell.  Let  tk  +  Bk  (Bk  >  1)  denote  the  time 
slot  at  which  the  cumulative  number  of  non-tagged  arrivals 
(releases)  to  the  scheduler  following  tk  exceeds  T  -  2  for 
the  first  time.  Let  a  superscript  d  (s)  indicate  a  quantity 
associated  with  the  dynamic-R&S  (static-R&S)  policy  and 
let 

Wk  =  min{Bk,T}  or  W  =  min{B,T),  (l) 

where  the  last  expression  involves  the  generic  random  vari¬ 
ables  \Y  and  B .  The  (k+  l)st  tagged  cell  release  time  £*+i 
is  given  by 

tk+i  =  tk  +  Hfc  +  ll'l  *  1{o;=o,4i,",*=o}’  (2) 

where  llk  denotes  the  time  interval  between  tk  =  tk  4-  14* 
and  the  first  tagged  cell  arrival  following  tk;  T  is  a  constant 
positive  integer;  J  is  the  number  of  cell  arrivals  to  the 
dynamic  regulator  over  j  slots. 

If  T  is  equal  to  the  minimum  spacing  among  consecutive 
tagged  cell  releases  from  the  regulator  in  the  RCSP  scheme 
[7],  then  it  is  easy  to  see  that  the  above  release  policy  will 
accelerate  the  tagged  cell  releases  from  the  regulator  at 
times  when  a  minimum  spacing  of  T  at  the  output  of  the 
scheduler  would  be  violated.  This  acceleration  occurs  when 
Bk<T. 

The  tagged  cell  release  of  the  (equivalent)  static  R&S 
policy  can  be  easily  described  in  terms  of  that  of  the  dy¬ 
namic  R&S  by  replacing  \Vk  by  7\ 

3  Study  of  Regulator  Behavior 

The  behavior  of  the  R&S  schemes  is  evaluated  by  inves¬ 
tigating  their  impact  on  a  specific  stream  (tagged  stream). 
The  traffic  at  the  output  of  the  regulators  associated  with 
the  remaining  N  -  I  applications  is  aggregated  and  forms 
the  background  traffic  which  competes  with  the  tagged 
traffic  for  resources  at  the  scheduler. 

The  improved  performance  of  the  dynamic- R&S  policy 
over  the  static-R&S  policy  is  established  by  evaluating  the 
cell  interdeparture  process  at  the  output  of  the  scheduler. 
Since  the  input  process  to  the  scheduler  is  the  output  pro¬ 
cess  from  the  regulator,  it  is  important  that  the  latter  be 
investigated  to  both  gain  insight  into  the  combined  system 
(regulator  plus  scheduler)  behavior,  as  well  as  evaluate  the 
output  process  at  the  scheduler. 

The  basic  operational  difference  between  the  dynamic- 
R&S  and  static-R&S  schemes  is  captured  by  the  tagged  cell 
interdeparture  process  from  the  regulator  {V*}*>i,  where 
Vk  -  1  -  tk . 

In  order  to  decouple  the  intrinsic  behavior  -  to  be  in¬ 
vestigated  in  this  paper  -  of  the  R&S  schemes  from  the 
source  load,  the  heavy'  traffic  source  assumption  will  be 
made  throughout  the  paper.  This  assumption  is  consis¬ 
tent  with  standard  ones  made  in  order  to  determine  the 
throughput  capabilities  of  a  scheme  as  well  as  the  through¬ 
put  fluctuations  (jitter),  without  the  noise  introduced  by 


source  inactivity  periods.  In  view  of  (2)  and  under  the 
heavy  traffic  source  assumption,  it  is  easy  to  establish  that: 

V*  —  Wh  (dynamic-R&S)  and  V*5  =  T  (static- R&rS)  (3) 

since  the  indicator  function  in  (2)  is  always  zero.  The  prob¬ 
abilistic  description  of  Vj*  may  be  found  in  [11]  along  with 
the  detailed  derivations  involved  in  the  following  proposi¬ 
tion  establishing  the  comparative  behavior  of  the  two  poli¬ 
cies. 

Proposition  1  :  The  maximum  throughput  (output) 

rate  of  the  tagged  regulator  under  the  two  policies  is  given 
by 

R-max  ~  £ |  |  an(^  ^mnz  — 

Notice  that  R^ax  >  equality  only  when  the  back¬ 

ground  traffic  process  can  never  deliver  more  than  T  —  2 
cells  over  T  —  1  consecutive  slots. 

The  above  establish  that  the  dynainic-R&S  scheme  will 
respond  to  a  sudden  increase  of  the  background  load  *  by 
increasing  its  regulator’s  output  rate  above  the  targeted 
rate  of  1  /T,  in  an  effort  to  ensure  that  the  targeted  rate 
at  the  output  of  the  scheduler  is  achieved.  The  impact  of 
such  a  reaction  (which  is  not  possible  under  the  static-R&S 
scheme)  on  the  scheduler  output  process  is  investigated  in 
the  next  section. 


4  Study  of  Scheduler  Behavior 


The  tagged  cell  interdeparture  process  {A'*}jb>i  from 
the  scheduler  has  been  derived  in  order  to  evaluate  the 
throughput  /  jitter  properties  of  the  R&S  schemes.  Since 
the  deliverability  of  the  QoS  of  the  tagged  application 
(which  is  considered  to  be  a  measure  of  the  induced 
throughput  /  jitter)  is  expected  to  be  decreased  under  high 
load  conditions  at  the  scheduler,  such  conditions  -  both 
within  the  stability  and  instability  regions  of  the  system  - 
will  be  considered  at  the  scheduler. 

Under  underload  conditions  (p  <  1)  the  scheduler  queue 
is  stable  and  since  no  cell  overflow  is  possible  (assuming  in¬ 
finite  buffer  at  the  scheduler),  the  following  relationships 
between  the  maximum  regulator  ( Rmax )  and  scheduler 
(Smax)  throughputs  hold: 


D<i  _  qrf 

Amnz  Jmnz  » 


ds  _  C* 

^max  J max 


and 


nd  ■>  e» 
uma  i  —  ‘-'max 


(5) 


The  Probability  Mass  Function  (PMF)  of  A*  has  been 
derived  under  both  policies  by  formulating  appropriate 
queuing  systems,  assuming  infinite  buffer  capacities  at  the 
regulator  and  scheduler  and  heavy  traffic  conditions  at  the 
regulator.  The  derivations  -  under  both  underload  (p  <  1) 
and  overload  (p  >  1)  -  can  be  found  in  [11].  Finally  it  can 
be  shown  that  under  overload  conditions  at  the  scheduler 

2 which  would  result  in  an  instantaneous  reduction  of  the  tagged 
throughput  at  the  output  of  the  scheduler 


the  dynamic-R&S  policy  will  potentially  reduce  the  tagged 
cell  spreading  (while  it  will  never  increases  it)  compared  to 
the  static-R&rS  policy. 


5  Numerical  Results 

The  analytical  results  presented  below*  have  been  ob¬ 
tained  for  two  values  of  the  target  interdeparture  time  T  - 
or  desirable  throughput  1/T  -  equal  to  T  =  5  and  T  =  10. 
The  background  traffic  is  modeled  as  an  independent  per 
slot,  batch  process  with  binoinially  distributed  batch  size  of 
maximum  value  N  —  1  =  8  and  success  probability  pt.  The 
background  load  or  utilization  is  denoted  by  ptack  (Back 
Util). 

Figure  2  presents  the  regulator  throughput  vs  pback 
under  both  policies.  As  ptack  increases,  the  dynamic- 
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Figure  2:  Regulator  Throughput  VS  background  utiliza¬ 
tion  Ptack  (pback  <  !)• 


R&S  policy  can  detect  the  increased  background  intensity 
and  release  cells  earlier,  attempting  to  provide  the  tar¬ 
geted  throughput  (1/T)  and  control  jitter.  As  a  conse¬ 
quence,  the  rate  by  which  the  packets  leave  the  regulator 
increases  as  the  background  intensity  increases.  The  sched¬ 
uler  throughput  Sm(lx  vs  ptack  is  shown  in  Fig.  3  for  both 
policies  under  underload  conditions  (p  <  1).  Smax  is  cal¬ 
culated  as  l/E{Xk}‘  As  expected  from  (5),  S^nx  =  Rmax 
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Figure  3:  Scheduler  throughput  vs  pback 


under  underload  conditions  and  infinite  buffer  capacity  at 
the  scheduler.  Although  continues  to  increase  as 

Pback  increases  (as  said  earlier),  S^nax  starts  deviating  from 
R-max  anc*  declines  beyond  some  value  of  pbnck  equal  to 
about  .78  for  T  =  5.  This  is  due  to  the  fact  that  the  sched¬ 
uler  reaches  an  overload  state  (p  >  1)  and  the  dynamics 
change.  Specifically,  an  infinite  scheduler  queue  backlog 
is  built  up  under  these  conditions,  resulting  in  a  different 
than  Rmax  output  rate  from  the  scheduler,  a s  discussed 
earlier.  Thus,  only  the  results  for  Rmnz  +  Pback  <  1  are 
relevant. 

The  detailed  traffic  smoothness  characteristics  of  the 
two  R&S  schemes  are  shown  in  Fig.  4  for  underload  con¬ 
ditions  at  the  scheduler.  Tagged  cell  clustering  (A'*  <  T) 
is  seen  to  slightly  increase  under  the  dynamic- R&S  pol¬ 
icy,  while  spreading  ( X *  >  T)  is  substantially  reduced  and 
more  probability  mass  is  concentrated  around  T.  While 
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Figure  4:  Jitter  (A'*)  PMF  (pback  <  1) 


the  spreading  reduction  under  the  dynamic- R&S  policy  is 
expected,  the  slight  increase  in  the  clustering  is  less  obvi¬ 
ous.  It  may  be  attributed  to  the  higher  probability  that 
the  scheduler  queue  is  nonempty  under  the  dynarnic-R&S 
policy,  due  to  the  higher  scheduler  load  (Smaz)  resulting 
from  a  higher  regulator  throughput  (Rmax)- 

The  traffic  smoothness  characteristics  of  the  two 
schemes  under  overload  traffic  conditions  at  the  scheduler 
can  be  observed  in  Fig.  5.  The  jitter  PMF  is  highly  con¬ 
tained  around  the  target  value  T  under  the  dynamic-R&S 
scheme,  and  it  is  spread  over  wide  range  of  values  under 
the  static-R&S  scheme. 

Finally,  a  system  in  which  N  =  7  ON-OFF  Markov 
sources  (srco  -srce)  controlled  by  the  R&S  policies  is  con¬ 
sidered  and  is  studied  through  simulationsjthe  parameters 
of  the  geometric  length  of  the  On  and  OFF  periods  are  de¬ 
noted  by  Pon  and  P0fj ,  respectively.  The  objective  of  this 
study  is  to  investigate  whether  the  good  characteristics  of 
the  dynamic  RandS  policy  are  maintained  when  multiple 
sources  are  controlled  by  such  a  policy. 

The  actual  cell  interdeparture  times  (A*)  from  the 
scheduler  were  recorded,  after  filtering  out  the  gaps  caused 
by  the  source’s  OFF  periods,  Thus  the  calculated  through¬ 
put  (  called  the  active  throughput,  Sft)  will  be  higher  than 
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Figure  5:  Jitter  (A'*)  PMF  {pback  >  1) 


src# 

— 

VARlX'l  1 

VAR[Xt) 

0 

0.397 

0.430 

0.435 

3.122 

2.948 

1 

0.383 

0.42G 

0.431 

3.132 

2.980 

2 

0.400 

0.432 

0.437 

3.026 

2.913 

3 

0.043 

0.092 

0.065 

8.262 

48.028 

4 

0.042 

0.090 

0.063 

8.986 

51.963 

5 

0.111 

0.157 

0.124 

11.99 

20.319 

6 

0.111 

0.162 

0.126 

9.825 

17.710 

Table  1:  Simulation  results 


the  actual  average  throughput.  This  filtering  of  the  data 
allows  for  capturing  the  effect  that  the  policies  have  on  the 
sources  while  they  are  active,  without  being  obscured  by 
the  OFF  periods  where  the  policies  are  ineffective. 

Several  simulation  scenarios  were  considered  with  dif¬ 
ferent  system  utilization  factors.  Results  are  shown  here 
for  p  =  1.49  (overload). 

The  results  of  the  simulation  scenario  are  shown  in  Ta¬ 
ble  1;  Pon  —  P0fj  =  .8  for  srco, srci , srci,  P0n  —  «8  and 
P0jj  =  .85  for  src2,src4  and  Pon  =  .8  and  P0jj  =  .75  for 
src5,src6.  The  target  cell  spacings  (Armjn)  enforced  were: 
A"min  =  1  for  srco  -  srco,  Xmin  =  10  for  src3  -  src4  and 
A'min  =  5  for  src 5  —  src$.  Sources  stcq  —  src-j  are  practi¬ 
cally  unregulated  since  they  are  allowed  to  release  cell  as 
soon  as  they  are  generated.  For  each  source,  the  average 
source  rate  A aVc  is  shown  in  table  1,  along  with  the  mea¬ 
sured  active  throughput  S%  and  variance  V A  R[ X%]  under 
the  dynamic-R&S  and  S'  and  variance  VAR[X£\  under 
the  static-R&S  policies. 

The  positive  impact  of  the  dynamic- R&S  policy  on  the 
regulated  sources  (src3,  src^,  srcs,  src6)  is  clearly  observed 
in  Table  1:  the  variance  of  the  interdeparture  process  un¬ 
der  the  dynarnic-R&S  scheme  (V AR[X%})  is  dramatically 
smaller  than  the  variance  under  the  static-R&S  scheme 
(V  AR[Xl\).  At  the  same  time  the  S'*  is  higher  than  the  S'. 
This  quantifies  the  ability  of  the  dynamic-R&S  scheme  to 
control  the  delay  jitter  better  than  the  static-R&S  scheme, 
for  all  of  the  regulated  sources.  The  dynamic-R&S  scheme 


manages  to  serve  the  regulated  sources  with  a  less  variable 
service  rate  and,  as  a  result,  the  traffic  maintains  better  the 
desirable  characteristics  at  the  output  of  the  scheduler. 

The  unregulated  sources  (src^,srci,src2)  experience  a 
slightly  higher  V AR[X$\.  Since  the  target  Arm;n  for  these 
sources  is  1,  the  two  policies  are  basically  ineffective  and 
the  variance  and  active  throughput  are  shaped  solely  by 
the  background  interfering  process  seen  by  each  of  these 
sources  during  each  slot.  The  background  interfering  pro¬ 
cesses  are  different  under  the  two  policies.  It  is  expected 
that  under  the  dynamic-R&S  policy  more  cells  will  be  re¬ 
leased  to  the  scheduler  per  slot  during  the  ON  period,  due 
to  the  early  releases. 

The  above  results  can  be  viewed  graphically  in  terms 
of  the  empirical  PMFs  shown  in  Fig.  6  and  Fig.  7.  The 
PMF  at  the  target  value  is  substantially  higher  under  the 
dynarnic-R&S  scheme.  This  may  be  attributed  to  the  re¬ 
duced"  spreading.  The  empirical  PMFs  for  srco,src\ ,  src> 
are  almost  identical  under  the  two  policies  and  they  are 
not  shown. 


Figure  6:  Empirical  Jitter  PMFs  :  srcz  and  src^ 


derload  and  overload  conditions  -  than  the  static-R&S 
scheme. 
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6  Conclusion 

In  this  paper  a  dynamic  Regulation  and  Scheduling 
(R&rS)*  scheme  has  been  proposed  and  studied  through 
analysis  and  simulation.  The  analytical  and  simulation 
results  clearly  have  showm  that  the  dynamic-R&S  can  pro¬ 
vide  substantially  better  jitter  control  -  both  under  un¬ 
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